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1.  Study  of  Arithmetic  Properties  of 
Linear  Differential  Equations 

For  the  period  of  the  Grant  we  conducted  examinations  of  arithmetic  and  geometric 
properties  of  linear  differential  equations,  using  methods  of  Pade  approximations.  These 
studies  were  aimed  at  detection  of  special  arithmetic  properties  of  solutions  of  these  equa¬ 
tions  that  can  be  used  to  study  diophantine  approximations  of  classical  constants.  Recently, 
we  concentrated  our  work  on  examination  of  arithmetic  and  geometric  properties  of  linear 
differential  equations  that  distinguish  the  deformation  equations  (Picard-Fuchs  equations) 
from  other  linear  differential  equations.  ^ - 

About  20  years  ago  it  was  discovered  and  established  that  linear  differential  equations 
having  a  geometric  sense,  like  the  Picard-Fuchs  equations  satisfied  by  the  variation  of 
periods,  possess  strong  arithmetic  properties  (global  nilpotence,  action  of  the  Frobenius, 
Fuchsianity,...  etc).  We  refer  to  the  review  [l].  Since  then  it  was  suspected,  that  in  a 
certain  sense,  the  converse  is  true  too.  We  found  new  theoretic  and  experimental  evidence 
that  show  that  linear  differential  equations  with  arithmetic  (or  integrality)  properties  of 
their  solutions  arise  from  geometry,  or,  precisely,  correspond  to  deformations  of  period 
structure  of  algebraic  manifolds.  A  variety  of  problems  from  diophantine  geometry  arise 
here,  including  the  irrationality  and  diophantine  approximations  to  constants  of  classical 
analysis. 

We  start  with  the  definition  of  the  basic  class  of  function.  In  his  seminal  paper  [3]  on 
diophantine  approximations,  Siegel  defined  and  targeted  for  future  studies  two  classes  of 
functions  satisfying  linear  differential  equations  and  given  by  power  series  expansions  in  z, 
for  the  values  of  which  one  can  establish  general  theorems  on  irrationality,  transcendence 
and  on  the  measure  of  linear  independence.  These  two  classes  are  classes  of  E-functions 
and  G-functions,  that  command  attention  of  modern  diophantine  approximations  research. 
The  study  of  E-functions,  started  by  Siegel  [3-4],  were  significantly  advanced  since  then  by 
many  researchers,  see  particularly  [5-6].  We  would  like  to  mention  in  this  connection  that 
only  relatively  recently  we  have  proved  results  on  the  best  possible  measure  of  diophantine 
approximations  of  values  of  E-  functions  at  rational  points  [7].  These  results  present  an 
ultimate  effective  version  of  the  Schmidt  theorem  [9]  for  the  values  of  E-functions.  In 
algebraic  geometry  and  analysis,  however,  most  of  the  interesting  functions  are  analytic 
only  in  the  finite  part  of  the  complex  plane  and  have  much  better  p-  adic  convergence 
properties.  Among  these  the  G-functions  play  the  crucial  arithmetic  role. 

Definition!,!.  A  function  /(z)  with  the  expansion  at  z  =  0: 

/(*)  =  £ 

n  =  0 

is  called  a  G-function,  if  f(x)  satisfies  a  linear  differential  equation  over  Q(z),  if  coefficients 
an  are  algebraic  numbers,  and  if  there  is  a  constant  C  >  1  such  that  for  all  n  >  0  the  sizes 
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of  coefficients  an  (i.e.  the  maximum  of  absolute  values  of  an  and  all  its  conjugates)  and 
the  common  denominators  den{ao, . . . ,  a*}  are  bounded  by  Cn. 

Siegel  introduced  this  class  of  G-functions  and  put  forward  a  program  to  prove  the 
linear  independence  theorems  for  values  of  G-functions  at  algebraic  points  near  the  origin. 
Unfortunately,  Seigel  [3]  never  explicitly  proved  general  theorems  for  G-functions,  instead 
presenting  examples  of  such  theorems  and  presenting  an  outline  of  a  theory  that  could  be 
constructed  similar  to  the  theory  of  .E-functions.  Progress  in  the  later  study  of  G-functions 
becomes  heavily  dependant  on  additional  very  restrictive  global  “geometric”  conditions, 
formulated  for  the  first  time  by  Galoihkin  [9],  that  demand  that  the  G-function  property 
be  shared  by  an  expansion  of  any  other  solution  of  a  differential  equation  satisfied  by  f(x) 
at  any  algebraic  point.  We  called  these  global  conditions  in  [10],  [ll]  the  (G,  C)-conditions. 
Previously  known  results  on  G-  functions  rely  exclusively  on  these  (G,  C)-conditions,  or  on 
equivalent  ones  [9],  [12],  [13],  [11].  These  global  conditions  can  also  be  reformulated  [12]  in 
terms  of  the  p-adic  “overconvergence  at  a  generic  point”  of  solutions  of  a  linear  differential 
equation  satisfied  by  /( i). 

In  [14]  and  [15]  we  had  proved  the  general  linear  independence  results  for  values 
of  arbitrary  G-functions  at  algebraic  points  (close  to  the  origin),  without  any  additional 
conditions.  These  results  materialize  Siegel’s  program  after  some  55  years.  Also  in  [14],  [15] 
proofs  of  results  on  the  absence  of  algebraic  relations  are  presented.  It  is  more  important, 
however,  that  we  have  proved  the  strong  (G,  G)-property  for  arbitrary  G-  functions  [20]. 
This  result,  connected  with  our  study  of  the  Grothendieck  conjecture,  implies,  e.g.  that 
all  previous  results  on  G-function  theory,  proved  under  very  restrictive  conditions,  are 
unconditionally  valid  for  all  G-functions.  To  describe  our  result,  and  the  (G,  C)-function 
conditions,  one  needs  the  definition  of  the  p-curvature. 

We  consider  a  system  of  matrix  first  order  linear  differential  equations  over  Q(x), 
satisfied  by  functions  fi[x)  :  i  =  1, . . .  ,n: 


n 

dfi(z)/dx  =  ^  AiA*)fi(x)y  (1-1) 

3= i 

for  Ai'j(z)  €  Q(x)  :  i,j  =  1, . . .  n.  Rewriting  the  system  (1.1)  in  the  matrix  form 

df'/dx  =  Aff;A€Mn(Q(x)), 

one  can  introduce  the  p-curvature  operators  V/pt  associated  with  the  system  (l.l).  The 
p-curvature  operators  ¥p  are  defined  for  a  prime  p,  as 

'tp  =  {d/dx  -  A)p(modp). 

Then  is  a  linear  operator  that  can  be  represented  as  =  -Ap(modp),  where  one 
defines  for  m  >  0, 

(djdx)m  =  Am(modQ(z)[d/dx](d/dz  -  A)).  (1.2) 

Let  D{x)  be  a  polynomial  from  Z[x]  that  is  the  denominator  of  A ,  i.e.  Z?(x)AtiJ( x)  is  a 
polynomial  in  Z[x]  for  i,j  +  l,...,n.  The  (G,C)- function  condition  [10]-  [ll]  of  (1.1)  means 
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that  (1.1)  is  satisfied  by  a  system  (fi(x), . . . ,  /n(x))  of  G-functions,  and  that  there  exists 
a  constant  C2  >  1,  such  that  for  any  N ,  the  common  denominator  of  all  coefficients  of  all 
polynomial  entries  of  matrices  D(x)mArn(x)/m\  :  m  =  0, . . . ,  N,  is  growing  not  faster  than 
C2  .  With  this  conditions  is  closely  related  a  global  nilpottnct  condition  [15-18]  stating 
that  the  matrices  are  nilpotent  for  almost  all  primes  p.  The  (G,  C)-condition  implies 
the  global  nilpotence  condition. 

In  [15]  we  proved  the  global  nilpotence  (and  the  (G,C)-  function  condition)  of  linear 
differential  equations  having  a  G-function  solution.  To  prove  this  result  we  used  Pade 
approximants  of  the  second  kind. 

Theoreml.2.  Let  fi(x), . . . ,  fn(x)  be  a  system  of  G-functions,  satisfying  a  system  of 
first  order  linear  differential  equations  (1.1)  over  Q(x).  If  fi{x), . . .  /n(x)  are  linearly 
independent  over  Q(x),  then  the  system  (1.1)  satisfies  a  (G, G)-function  condition  and  is 
globally  nilpotent. Any  solution  of  (l.l)  with  algebraic  coefficients  in  Taylor  expansions  is 
a  G-function. 

Let  Ly  =  0  be  a  linear  differential  equation  of  order  n  over  Q(x)  satisfied  by  a  G- 
function  y{x),  and  y(x)  does  not  satisfy  a  linear  differential  equation  over  Q(x)  of  order 
<  n.  Then  the  equation  Ly  =  0  is  globally  nilpotent,  and  all  solutions  of  the  equation 
Ly  =  0  with  algebraic  initial  conditions  at  an  algebraic  point  x  =  xq  have  G-function 
expansions  at  x  =  zo- 

Our  main  tool  in  the  study  of  the  Grothendieck  conjecture,  and  in  the  current  study 
of  globally  nilpotent  equations  is  the  analytic  method  of  Pade,  and  more  general  algebraic 
approximations  to  functions  satisfying  nontrivial  complex  analytic  and  arithmetic  (p-adic) 
conditions.  The  corresponding  group  of  results  can  be  considered  as  a  certain  “local-global” 
principle.  According  to  this  principle,  algebraicity  of  a  function  occurs  whenever  one  has 
a  near  integrality  of  coefficients  of  power  series  expansion — local  conditions,  coupled  with 
the  assumptions  of  the  analytic  continuation  (controlled  growth)  of  an  expanded  function 
in  the  complex  plane  (or  its  Riemann  surface) — a  global,  archimcdian  condition. 

To  prove  the  algebraicity  of  an  integral  expansion  of  an  analytic  function,  only  as¬ 
sumptions  on  a  uniformization  of  this  function  have  to  be  made.  Our  results  from  [19]  and 
[20]  were  proved  in  the  multidimensional  case  as  well,  to  include  the  class  of  functions, 
uniformized  by  Jacobi’s  theta-functions  (e.g.  integrals  of  the  third  kind  on  an  arbitrary 
Riemann  surface).  Moreover,  our  result  includes  “the  nearly-integral”  expansions,  when 
the  denominators  grow  slower  than  a  typical  factorial  n!  denominator.  E.g.,  one  of  the 
results  [19-20],  shows  that  g  +  1  functions  in  g  variables  having  nearly  integral  power  series 
expansions  at  x  =  0  and  uniformized  near  x  =  0  by  meromorphic  functions  of  finite  order 
of  growth  are  algebraically  dependent. 

Among  the  applications  of  this  result  are  some  partial  but  effective  results  on  the 
Tate  conjecture  on  the  bijectivity  of  a  map  Hom(A,B)  ®  Z/  — ►  Hom(Ti(A),T/(£))  for 
Abelian  varieties  A  and  B  over  algebraic  number  fields.  The  Tate  conjecture  for  elliptic 
curves  was  left  open  by  Serre  [21]  in  the  case  when  A  and  B  have  integral  invariants  but 
no  complex  multiplication.  Finally  Faltings  [22]  proved  (ineffectively)  the  finiteness  of  the 
isogeny  classes  for  arbitrary  Abelian  varieties,  solving  the  Tate,  Schafarevich  and  Mordell 
conjectures.  We  proved  the  effective  version  of  the  Tate  conjecture  for  elliptic  curves  using 
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only  (20j  and  the  Honda  [23]  criterion  of  isogeny  of  elliptic  curves  in  terms  of  ihe  logarithms 
of  their  formal  groups  over  Fp: 

Corollaryl.3.  If  two  elliptic  curves  £i/Q  and  £2/Q  have  the  same  number  of  points 
modp  for  almost  all  p,  (or  even  for  almost  all  p  below  an  effective  bound),  then  E\j Q  and 
Ei!  Q  are  isogeneous  over  Q. 

The  demand  to  have  a  meromorphic  uniformization  of  is  a  restrictive  one,  and,  com¬ 
bined  with  nearly  integrality  condition,  'imit  us  pretty  much  to  class  of  functions  uni- 
formized  by  0-functions.  However,  results  similar  to  [19-20]  can  be  proved  for  solutions  of 
linear  differential  equations  uniformized  by  special  Fuchsian  groups.  Namely,  it  is  true  for 
functions  with  nearly  integral  coefficients  that  satisfy  linear  differential  equations,  whose 
monodromy  group  is  up  to  the  conjugation  a  subgroup  of  GLn( Q) 

We  have  applied  the  Pade  approximations  methods  used  in  diophantine  approxima¬ 
tions  and  G-functions,  to  the  Grothendieck  conjecture  that  determines  the  global  (mon¬ 
odromy)  properties  of  a  linear  differential  equation  in  terms  of  reductions  (mod  p)  of  this 
differential  equation. 

TheGrothendieckConjecture.  If  a  scalar  linear  differential  equation  of  order  n  over 
Q(x)  has  n  solutions  (mod  p)  in  Fp(x),  linearly  independent  over  Fp(xp),  for  almost  all 
prime  numbers  p,  then  this  linear  differential  equation  has  only  algebraic  function  solutions. 

Equivalently,  if  a  matrix  system  (1.1)  of  differential  equations  over  Q(x)  has  a  zero 
p-curvature  =  0  for  almost  all  p,  then  this  system  (1.1)  has  algebraic  function  solutions 
only. 

Using  our  local-global  algebracity  results,  we  solved  the  Grothendieck  conjecture  for 
large  classes  of  equations,  including  equations,  solutions  of  which  can  be  parametrized 
by  means  of  multidimensional  theta-functions.  To  the  class  of  these  equations  belong 
equations  of  rank  one  over  arbitrary  (finite)  Riemann  surfaces  [20]: 

ThtoremlA.  Any  rank  one  linear  differential  equation  over  an  algebraic  curve,  i.e.  a 
first-order  equation  with  algebraic  function  coefficients,  satisfies  the  Grothendieck  conjec¬ 
ture.  Namely,  if  T  is  an  algebraic  curve  (given  by  the  equation  Q(z,w)  =  0)  over  Q,  and 
if  the  rank  one  equation 

dF 

—  =  u(z,  w)dz  (1.3) 

over  Q(T)  (for  am  Abelian  differential  udz  on  T)  is  globally  nilpotent,  then  all  solutions  of 
(1.3)  are  algebraic  functions. 

The  relationship  of  the  p-curvature  operators  with  the  monodromy  (Galois)  group 
of  a  differential  equation  is  extremely  interesting.  Our  methods,  involving  various  gen¬ 
eralizations  of  Pade  approximations,  allow  us  to  prove  the  Grothendieck  conjecture  for  a 
larger  class  of  differential  equations,  when  additional  information  on  a  monodromy  group 
is  available.  For  example,  if  a  second  order  linear  differential  equation  has  a  commutative 
monodromy  group,  then  this  equation  satisfies  the  Grothendieck  conjecture  (the  Lame’ 
equation  with  an  integral  parameter  n  belongs  to  this  class).  A  technique  from  [27]  using 
a  random  walk  method  (in  the  free  group,  corresponding  to  the  representation  of  a  full 
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modular  group  SLi{Z)),  allowed  us  to  treat  crucially  important  class  of  equations.  The 
random  walk  technique  used  in  [27]  is  the  extension  of  the  earlier  work  [28j.  Among  the 
results  of  [27]  is  a 

Theoreml.5  Let  Ly  —  0  be  an  n-th  order  linear  differential  equation  over  Q(x)  satis¬ 
fying  the  assumptions  of  the  Grothendieck  conjecture.  If  the  monodromy  group  of  Ly  =  0 
is  up  to  a  conjugation  a  subgroup  of  GLn(Q),  then  all  solutions  of  Ly  =  0  are  algebraic 
functions. 

Our  results  on  the  Grothendieck  conjecture  are  effective  (i.e.  one  has  to  examine  only 
a  finite  set  of  primes  p),  and  can  be  used  in  various  algorithms,  including  algorithms  that 
determine  the  reduction  of  Abelian  integrals  to  elementary  functions,  see  [24], 

Various  applications  of  our  new  methods  to  the  proof  of  transcendence  of  numbers  ap¬ 
pearing  as  elements  in  the  monodromy  matrices  of  linear  differential  equations  are  possible. 
One  of  our  results  is  the  following  [28]: 

Theoreml.6.  Let  us  look  at  a  G-function  solution  (/i(x), .  • . ,  /„(x))  of  (1.1)  with 
algebraic  initial  conditions  at  a  non-singular  point  of  (1.1).  Then  its  analytic  continuations 
along  (the  basis  of)  all  possible  paths  leads  to  at  least  one  transcendental  number. 

Another  application  is  the  Matthews  problem  [29]  on  indefinite  integration  of  algebraic 
functions.  Let  T  be  a  curve  over  Q  and  D  be  any  derivation  of  a  function  field  Q(T).  Let 
/  €■  Q(r)  be  such  that  for  almost  all  p,  we  can  find  g  in  Fp(Tmodp)  such  that,  mod  p, 
/  =  Dg.  Is  it  true  then,  that 

f  =  Dg 


for  some  g  in  Q(T)?  (I.e.  if  an  integral  is  locally  algebraic  at  the  same  field,  is  it  globally 
algebraic?)  The  answer  is  yes. 

While  the  Grothendieck  conjecture  tries  to  describe  the  equations,  all  solutions  of 
which  are  nearly  integral,  it  is  more  important  to  find  out  which  equations  possess  G- 
function  solutions.  These  equations,  in  all  known  cases,  admit  the  action  of  Frobenius. 

Next,  all  evidence  points  towards  the  conjecture  that  the  globally  nilpotent  equations 
are  only  those  equations  that  are  reducible  to  Picard-Fuchs  equations  (i.e.  equations 
satisfied  by  Abelian  integrals  and  their  periods  depending  on  a  parameter). 

Our  results  on  G-functions  allow  us  to  represent  this  conjecture  even  in  a  more  fasci¬ 
nating  form,  see  [37]: 

SicgclConjecture.  Let  y(x )  =  J2's=oc^iN  be  a  G-function  (i.e.  the  sizes  of  cs  and 
the  common  denominators  of  {co, . . . ,  c.v }  grow  not  faster  than  the  geometric  progression 
in  iV) .If  y(x)  satisfies  a  linear  differential  equation  over  Q(x)  of  order  n  (but  not  of  order 
n  —  1),  then  the  corresponding  equation  is  reducible  to  Picard-Fuchs  equations.  In  this 
case  y(z)  can  be  expressed  in  terms  of  multiple  integrals  of  algebraic  functions. 

Siegel,  in  fact,  put  forward  a  conjecture  which  is,  in  a  sense,  stronger  than  the  one 
given  above.  To  formulate  Siegel’s  conjecture  we  have  to  look  again  at  his  ^-functions 
defined  in  the  same  paper  [3]  of  1929,  where  G-functions  were  defined,  and  where  Siegel’s 
theorem  on  integral  points  was  proved.  We  remind  that  a  function  /(x)  = 
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is  called  an  E-function  is  are  algebraic  numbers  and  for  any  e  >  0,  the  size  |c77j  of 
cat  and  the  common  denominator  of  {co, .  •  • ,  c//}  is  bounded  by  0[N\ *);  one  also  assumes 
that  f(x)  satisfies  a  linear  differential  equation  over  Q(x).  Siegel  showed  that  the  class  of 
E-  functions  is  a  ring  closed  under  differentiation  and  integration.  Siegel  also  studied  the 
hypergeometric  functions 

mFn(ZlzZ  |Ax) 

for  algebraic  A  /  0,  rational  parameters  and  6i,...,&»  and  m  <  n.  These 

functions  he  called  hypergeometric  E=functions  and  suggested  in  [4]  all  E-functions  can 
be  constructed  from  hypergeometric  E-functions. 

Looking  at  the  (inverse)  Laplace  transform  of  /(x),  we  see  that  Siegel’s  conjecture 
translates  into  a  conjecture  on  G-function  structure  stronger  than  Siegel’s  conjecture  given 
above.  Indeed,  it  would  seem  that  all  Picard-Fuchs  equations  might  be  expressed  in  terms 
of  generalized  hypergeometric  functions! 

This  stronger  conjecture  is  not  entirely  without  merit;  e.g.  one  can  reduce  linear 
differential  equations  over  Q(x),  satisfied  by  G-functions  to  higher  order  equations  over 
Q(*)  with  regular  singularities  at  x  =  0,/,oc  only-(like  the  generalized  hypergeometric 
ones)  cf.  [23]. 

We  are  unable  so  far  to  give  a  positive  answer  to  the  conjecture  that  all  arithmetically 
interesting  (G— functions  are  solutions  of  Picard-Fuchs  equations.  Nevertheless,  in  some 
cases  we  can  prove  that  this  conjecture  is  correct.  For  now  our  efforts  are  limited  to  the 
second  order  equations  (which  provides  with  an  extremely  rich  class  of  functions). 

It  is  very  likely  that,  at  least  for  the  second  order  equations,  the  Siegel  conjecture  can 
be  proved  mod-Jc  the  full  Grolhendieck  conjecture. 
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2.  P-adic  properties  and  P-adic  Spectrum 
of  Linear  Differential  Equations 


First  interesting  applications  of  arithmetic  studies  of  linear  differential  equations  occur 
for  second  order  equations  with  4  regular  singularities, (Heun  equations),  and,  particularly, 
for  the  Lame  equation: 


„  1  r  1 
*  +2{I  + 


1  1  .  ,  B  -  n(n  +  l)z 

— r  + - }y  +  7 - - ry  —  o 

i-l  x-a  4x(x-l  ){x-a) 


(2.1) 


(depending  on  n  and  on  accessory  parameters  J3). 

A  more  familiar  form  of  the  Lame  equation  is  the  transcendental  one  (with  the  change 
of  variables:  a  =  k~ 2,  x  =  (sn(u,&))2)  [26]: 


d?y 

du2 


4-  k2  ■  {B  -  n(n  +  l)sn2(u, k)}y  =  0 


(2.2) 


or 


dry 
du 2 


{H  -  n(n  +  l)P(u)}y  =  0. 


(2.3) 


For  the  Lame  equation  (2.3)  with  an  integral  n  defined  over  Q  (i.e.  a  6  Q  and  B  €  Q), 
our  local-global  principle  of  algebraicity  can  easily  solve  the  Grothendieck  conjecture  [22], 
[26].  We  proved  in  [22]: 


Theorem2.1.  For  integer  n  >  0  the  Lame  equation  has  zero  p  -  curvature  for  almost 
all  p  if  and  only  if  all  its  solutions  are  algebraic  functions.  The  Lame  equation  is  globally 
nilpotent  for  2n  +  1  values  of  B  :  B  =  B™  -  ends  of  lacunae  of  spectrum  of  (2.3). 

For  all  other  values  of  B ,  the  global  ni'potence  of  the  Lam£  equation  with  integral  n 
over  Q  is  equaivalent  to  the  algebraicity  of  all  solutions  of  (2.3). 

The  possibility  of  all  algebraic  solutions  of  (2.3)  with  B  ^  B™  was  shown  by  Baldassari 
(Dwork).  It  can  correspond  only  to  special  torsion  points  H  =  P{uq)  for  special  elliptic 
curves. 

For  nonintegral  n  no  simple  uniformization  of  solutions  of  Lam£  equation  exits.  More¬ 
over,  Lam4  equations  themselves  provide  the  key  to  several  interesting  uniformization  prob¬ 
lems.  An  outstanding  Lam4  equation  is  that  with  n  =  —1/2.  This  equation  determines  the 
uniformization  of  the  punctured  tori.  If  one  starts  with  a  tori  corresponding  to  an  elliptic 
curve  y2  =  /^(x),  then  the  function  inverse  to  the  automorphic  function,  uniformizing  the 
tori  arises  from  the  ratio  of  two  solutions  of  the  Lam6  equation  with  n  =  —1/2. 


*>(*)»"  + j*!(*)v'  + 


x  +  C 
16 


y  =  o, 


(2.4) 


or 


0  +  ["+j'<  •)!»- 


0. 
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If  P^ix)  =  x(x  -  l)(x  -  a)  (i.e.  the  singularities  are  at  x  =  0,  a  and  oo),  then  the 
monodromy  group  of  (2.4)  is  determined  by  3  traces  x  =  tr(AfoMi),  y  =  tr(M0Ma), 
z  —  tr(A/iA/a).  Here  M  is  a  monodromy  matrix  in  a  fixed  basis  corresponding  to  a  simple 
loop  around  the  singularity  a.  These  traces  satisfy  a  single  Fricke  identity  : 

x2  +  y2  ■+•  z2  —  xyz  =  0. 

The  (Poincare)  uniformization  case  is  that,  when  in  (2.4)  the  monodromy  group  can 
be  represented  by  real  2x2  matrices.  There  exists  a  single  value  of  the  accessory  parameter 
C  for  which  the  uniformization  takes  place. 

AlgebraicityProblem[27l.  Let  an  elliptic  curve  be  defined  over  Q  (i.e.  a  €  Q).  Is 
it  true  that  the  corresponding  (uniformizing)  accessory  parameter  C  is  algebraic?  Is  the 
corresponding  Fuchsian  group  a  subgroup  of  GL^iQ)  (i.e.  x, y  and  z  are  algebraic)? 
Conversely,  if  x,y  and  z  are  algebraic  corresponding  to  a  Fuchsian  subgroup  of  SL2(Q). 
is  it  true  that  the  uniformized  tori  is  defined  over  Q? 

Extensive  multiprecision  computations,  we  first  reported  in  [27],  of  accessory  param¬ 
eters  showed  rather  bleak  prospect  for  algebraicity  in  the  accessory  parameter  problems. 

Namely,  as  it  emerged,  there  are  only  4  (classes  of  isomorphisms  of)  elliptic  curves 
defined  over  Q,  for  which  the  values  of  uniformizing  accessory  parameters  are  algebraic. 
These  4  classes  of  algebraic  curves  are  displayed  below  in  view  of  their  arithmetic  impor¬ 
tance. 

We  want  to  explain  here  that  the  attention  to  the  arithmetic  properties  of  the  Lame 
equation  with  n  =  -1/2  arose  shortly  after  Apery’s  proof  of  the  irrationality  of  f(2)  and 
$•(3).  His  proof,  1978,  was  soon  translated  into  assertions  of  integrality  of  power  series 
expansions  of  certain  linear  differential  equations. 

To  look  at  these  differential  equations  we  will  make  use  of  the  classical  equivalence 
between  the  punctured  tori  problem  and  that  of  4  punctures  on  the  Riemann  sphere.  For 
differential  equations  this  means  Halphen’s  algebraic  transformation  between  the  Lame 
equation  *-.4)  with  n  =  -1/2  for  P{x)  =  x(x  -  l)(x  —  a),  and  the  Heun  equation  with 
zero-differences  of  exponents  at  all  singularities: 

P(x)y"  +  P'{x)y' +  [x  +  H)y  =  0.  (2.5) 

C  =  4H  +  (1  +  o). 

We  have  already  stated  that  there  are  4  Lam4  equations  with  n  =  -1/2  (up  to  Mobius 
transformations)  for  which  the  value  of  the  accessory  parameter  is  known  explicitly  and  is 
algebraic.  These  are  4  cases  when  the  Fricke  equation 

x2  +  y2  +  z2  =  xyz, 

with  0<x<y<x<  xy/2,  has  solutions,  whose  squares  are  integers.  It  is  in  these  4  cases, 
when  the  corresponding  Fuchsian  group  uniformizing  the  punctured  tori  is  the  congruence 
(arithmetic)  subgroup,  see  references  in  [23],  [27]. 

Let  us  look  at  these  4  cases,  writing  down  the  corresponding  equation  (2.5): 
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1)  x(x 2  -  l)y"  +  (3x2  -  l)y'  +  xy  =  0. 

2)  x(x2  +  3x  +  3)y"  +  (3x2  +  6x  4-  3)y'  4  (x  4  l)y  =  0. 

3)  x(x  -  l)(x  +  8)y"  4  (3x2  -  14x  -  8)y'  4  (x  4  2)y  =  0. 

4)  x(x2  4-  llx  -  l)y"  4-  {3x2  4  22x  -  l)y'  4  (x  4  3)y  =  0. 

Each  of  the  equations  l)-4)  is  a  pull-back  of  a  hypergeometric  function  by  a  rational 
map.  In  fact,  for  each  of  the  equations  l)-4)  there  exist  an  integral  power  series  y(x)  = 
£2“_0c//xN  satisfying  Ly  =  0  and  regular  at  x  =  0. 

Apery’s  example  for  f(2)  arises  from  the  equation  4)  and  from  a  solution  of  a  non- 
homogeneous  equation  Ly  =  const  ^  0  x(x)  =  Y1<n= regular  at  x  =  0  with  nearly 
integral  d jv  (this  is  according  to  the  global  nilpotence  of  the  corresponding  L).  Then  ex¬ 
plicit  computations  show  that  cx/d^  provide  with  the  dense  system  of  approximations  to 
f(2)  showing  the  irrationality  of  f(2).  Similarly,  approximations  to  f(3)  arise  from  the  the 
symmetric  square  of  equation  3). 

These  examples  lead  to  a  method  of  the  construction  of  sequences  of  dense  approxi¬ 
mation  to  numbers  using  nearly  integral  solutions  of  globally  nilpotent  equations.  Often 
the  corresponding  equations  are  Picard-Fuchs  equations  satisfied  by  generating  functions 
of  Pade  approximants  to  solutions  of  special  linear  differentia)  eolations,  see  examples  in 
[28],  [29].  ' 

Diophantine  approximations  suggest  the  following  problem:  determine  all  cases  of 
global  nilpotence  of  Lame  equations. 

Our  intensive  numerical  experiments  reveal  predictable,  phenomenon:  it  seems  that, 
with  the  exception  of  equations  l)-4)  (and  all  equations  equivalent  to  them  via  M5bius 
transformations),  there  is  no  Lame  equations  with  n  =  —1/2  over  Q  that  are  globally 
nilpotent.  We  put  these  observations  as  a 

Conjccturc2.2.  Lam^  equations  with  n  =  — 1/2  defined  over  Q  are  not  globally  nilpo¬ 
tent  except  for  4  classes  of  equations  corresponding  to  the  congruence  subgroups,  with 
representatives  of  each  class  given  by  l)-4). 

What  are  our  grounds  for  this  conjecture?  First  of  all,  Pade  approximation  technique 
allows  us  to  prove  one  positive  result  in  the  direction  of  this  conjecture  for  the  n  =  —1/2 
case  of  the  Lame  equation. 

Theorem2.3.  For  fixed  o6Q(a/0,l),  there  are  only  finitely  many  algebraic  numbers 
C  of  bounded  degree  d  such  that  the  Lame  equation  with  n  =  -1/2  is  globally  nilpotent. 

Of  course,  one  wants  a  more  specific  answer  for  any  other  n  (e.g.  for  n  =  —1/2, 
there  are  only  4  classes  of  a  and  C  given  above  with  the  global  nilpotence).  However  for 
half-integral  n,  there  are  always  n  -  1/2  trivial  cases  of  global  nilpotence,  where  solutions 
are  expressed  in  terms  of  elliptic  integrals. 

We  have  started  the  study  of  globally  nilpotent  Lam6  equations  (2.4)  or  (2.5)  with  nu¬ 
merical  experiments.  This  ultimately  led  to  Conjecture  2.2.  We  checked  possible  equations 
of  the  form  (2.5)  with 

P{x)  =  x(x2  -  OiX-f-a), 
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i.e.  4  singularities  at  x  =  0,  oo  and  2  other  points,  for  values  of 

fli, a  6  Z 

in  the  box:  |a|,|ai|  <  200. 

For  all  these  equations  (2.5)  we  checked  their  p-curvature  for  the  first  500  primes.  Our 
results  clearly  show  that  with  an  exception  of  4  classes  of  equations  equivalent  to  1  )-4) , 
any  other  equation  has  a  large  proportion  of  primes  p  such  that  the  p-curvature  is  not 
nilpotent  for  any  H  €  Q! 

An  interesting  p-adic  problem  arises  when,  instead  of  globally  nilpotent  equations  one 
looks  at  the  nilpotence  conditions  of  p-curvature  for  a  fixed  p  or,  equivalently,  when  there 
is  a  p-integral  solution  for  a  fixed  p.  The  only  known  case  (Tate-Deligne) ,  corresponds 
to  Lame  equation  with  n  =  0,  where  the  unit  root  of  the  f -function  of  the  corresponding 
elliptic  curve  is  expressed  in  terms  of  a  unique  accessory  parameter.  This  example  suggests 
a  definition  of  p-adic  spectrum,  which  we  use  only  for  Lame  equations. 

We  are  interested  in  those  Hmodp  for  which  the  p-curvature  of  (2.5)  is  nilpotent,  and 
particularly  in  those  p-  adic  HE  Zp  for  which  there  exists  a  solution  y  =  y(x)  of  (2.5) 
whose  expansion  has  p-integral  coefficients.  We  call  those  H  £  Zp,  for  which  such  y(z) 
exists,  eigenvalues  of  (2.5)  in  the  “p-adic  domain”,  and  their  set  we  call  “an  integral  p-adic 
spectrum” .  The  problem  of  study  of  arithmetic  nature  of  Lame  equation  was  proposed  by 
I.M.  Gelfand. 

To  determine  p-adic  spectrum  we  conducted  intensive  symbolic  and  numeric  compu¬ 
tations,  usint  SCRATCHPAD  (IBM),  MACSYMA  (Symbolics)  and  array  processors. 

We  start  with  the  observations  of  the  “mod  p”  spectrum  as  p  varies. 

I.  For  noncongruence  equations  (2.5)  with  rational  a  ±  0, 1  (i.e.  for  an  elliptic  curve 
defined  over  Q  with  a  point  of  order  2)  there  seem  always  to  be  infinitely  many  primes 
p  for  which  no  value  of  the  accessory  parameter  H  mod  p  gives  a  nilpotent  p-curvature 
(thus  mod  p  spectrum  is  empty). 

Sometimes  the  first  prime  p,  for  which  the  mod  p  spectrum  of  (2.5)  is  null,  occurs 
quite  far.  Here  are  a  few  statistics  for  noncongruence  equations  with  rational  integers  a: 

For  a  =  3  the  first  p’s  with  the  null  spectrum  mod  p  are:  p  =  61,  311,  677,  1699,  1783, 
1811,  2579,  2659,  3253,...  .  For  a  =  5  the  first  p’s  with  the  null  spectrum  mod  p  are:  p 
=  659,  709,  1109,  1171,  1429,  2539,  2953,  2969,  3019,  3499,  3533,  3803,  3863,  4273,  4493, 
4703,  4903,  5279,  5477,  5591,  6011,  7193,  7457,  7583,...  .  For  a  =  4  the  corresponding 
p’s  with  the  null  spectrum  are:  p  =  101,  823,  1583,  2003,  3499,...  .  For  a  =  13  the 
corresponding  list  starts  at:  p  =  1451,  1487,  2381,...  . 

Observation  I  above  was  checked  by  us  for  all  noncongruent  P(x)  =  x(i2  —  a^x  +  a) 
with  integral  oio  not  exceeding  250  in  absolute  value. 

II.  An  integral  p-adi'  spectrum  of  equations  (2.5)  with  (p-  integral)  a  has  a  complicated 
structure  depending  on  the  curve,  p-adic  spectrum  can  be  null,  finite  (typically  a  single 
element),  or  infinite,  resembling  the  Cantor  set. 

Numerical  analysis  is  not  easy  either.  For  example,  in  order  to  determine  the  3- 
adic  spectrum  with  14  digits  of  precision  (in  the  3-adic  expansion),  one  has  to  carry  out 
computations  with  over  2,000,000  decimal  digit  long  numbers! 

Example  above  show  how  globally  nilpotent  equations  can  be  used  for  interesting 
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arithmetic  applications,  particularly  irrationality  proofs.  We  suggest,  as  a  starting  equa¬ 
tion,  when  it  is  of  the  second  order,  an  equation  corresponding  to  an  arithmetic  Fuchsian 
subgroup.  Congruence  subgroups  of  T(l)  and  quaternion  groups  provide  with  interesting 
families  of  globally  nilpotent  equations. 

One  can  start  with  equations  uniformizing  punctured  tori  with  more  than  one  punc¬ 
ture.  The  complete  description  of  arithmetic  Fuchsian  groups  of  signature  (l;e)  had  been 
provided  by  Maclachlan  and  Rosenberger  [30]  and  Takeuchi  [31]. 

For  all  (1;  c)  arithmetic  subgroups  there  exists  a  corresponding  Lame  equation  with 
a  rational  n,  uniformized  by  the  corresponding  arithmetic  subgroup.  This  way  we  obtain 
70  Lame  equations,  all  defined  over  Q.  Some  of  these  equations  give  rise  to  nearly  inte¬ 
gral  sequences  satisfying  three-term  linear  recurrences  with  coefficients  that  are  quadratic 
polynomials  in  n,  and  have  the  growth  of  their  denominators  and  the  convergence  rate 
sufficient  to  prove  the  irrationality  of  numbers  arising  in  this  situation  in  a  way  similar  to 
that  of  Apery. 

Groups  of  the  signature  (l;e)  correspond  to  the  Lame  equations 

P(z)y"  +  1  over2P'{x)y'  +  {C  -  x}y  =  0 
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with  n  +  ^  =  |e. 

In  the  arithmetic  case  one  looks  at  totally  real  solutions  of  the  modified  Fricke’s 
identity,  which  now  takes  the  form: 

x1  +  y2  +  z2  -  xyz  =  2(1  -  cos(^)). 

Using  numerical  solution  of  the  (inverse)  uniformization  problem,  we  determined  the 
values  of  the  accessory  parameters.  We  display  simplest  examples: 

Here  P[x)  =  x(x  —  l)(i  —  A)  and: 

(l;2)-case: 

1)  A  =  1/2,  C  =  -3/128, 

2)  A  =  1/4,  C  =  -1/64; 

3)  A  =  3/128,  C  =  -13/211; 

4)  A  =  (2  -  y/5)2,  C  >/5)/64; 

5)  A  =  (2  -  V3)\  C  =  -(2  -  v/3)2/24; 

6)  A  =  (21>/33-27)/256, 

(l;3)-case 

1)  A  =  1/2,  C  =  -1/36 

2)  A  =  32/81,  C  =  — 31/24  •  34; 

3)  A  =  5/32,  C  =  -67/2°  ■  32; 

4)  A  =  1/81,  C  =  -1/2  -34; 

5)  A  =  (8  -  3v/7)/24, 
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3.  Continued  Fractions  and 
Applications  to  Diophantine  Approximations. 


We  start  this  brief  exposition  of  our  results  on  diophantine  approximations  with  the 
following  translation  of  our  conjectures  from  §§1-2  on  globally  nilpotent  equation  into 
classical  problem  of  nearly  integral  solutions  to  linear  recurrences. 

Problem.  Let  un  be  a  solution  of  a  linear  recurrence  of  rank  r  with  coefficients  that 
are  rational  (polynomial)  in  n: 


r—  l 

u«+r  =  ^  ^k(n)  •  Un+k 
fe=0 


for  Afc(n)  €  Q(n)  :  k  =  0,  ...,r  —  1,  and  such  that  u„  are  “nearly  integral”.  Then  the 
generating  function  of  un  is  a  function  whose  local  expansion  represents  either  an  integral 
of  an  algebraic  function  or  a  period  of  an  algebraic  integral,  i.e.  a  solution  of  Picard- 
Fuchs-like  equation. 

For  continued  fractions  this  problem  can  be  reformulated. 

Problem1.  Let  us  look  at  an  explicit  continued  fraction  expansion  with  partial  fractions 
being  rational  functions  of  indices: 

a  =  [ao’,ai,...,A(n),A(n  +  1),...], 

for  A(n)  €  Q(n).  Let  us  look  then  at  the  approximations  Pn/Qn  to  a  defined  by  this 
continued  fraction  expansion.  If  the  continued  fraction  representing  a  is  convergent  and 
for  some  e  >  0 

fr -§*■!<  I 

Vn 

n  >  n0(e),  i.e.  if  a  is  irrational,  then  the  sequences  Pn  and  Qn  of  numerators  and  denom¬ 
inators  in  the  approximations  to  a  are  arithmetically  defined  sequences;  their  generating 
functions  represent  solutions  of  Picard-Fuchs  and  generalized  Picard-Fuchs  equations. 

How  often  do  such  continued  fraction  expansions  do  occur,  apart  from  classical  cases 
known  to  Euler  (Hermite  in  the  multidimensional  case)?  One  of  the  main  purposes  of  our 
investigation  was  an  attempt  to  establish,  first  empirically,  that  there  are  only  finitely  many 
classes  of  such  continued  fraction  expansions  all  of  which  can  be  determined  explicitly. 

In  applications  to  diophantine  approximations,  a  particular  attention  was  devoted  to 
three-term  linear  recurrences  like: 

nd  •  un  =  Pd(n)  •  un_i  -  <3a(n)  •  un_2  :  n  >  2 

for  d  >  2.  Apart  from  trivial  cases  (reducible  to  generalized  hypergeometric  functions), 
our  conjectures  claim  that  for  every  d  >  1,  there  are  only  finitely  many  classes  of  such 
recurrences  that  correspond  to  deformations  of  algebraic  varieties. 
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For  d  =  2  (second  order  equations)  we  have  classified  nontrivial  three-term  recurrences 
whose  solutions  are  always  nearly  integral,  assuming  our  integrality  conjectures.  Most  of 
these  recurrences  are  useless  in  arithmetic  applications.  There  are  a  few  new  ones  that 
give  some  nontrivial  results.  Among  these  recurrences  are  the  following: 

i)  2 n2un  =  2(  — 15n2  +  20n  —  7)  •  un_i  +  (3n  —  4)2  •  un-2\ 

ii)  3n2ura  =  (-12n2  +  18n  -  7)  •  un-i  +  (2 n  -  3)2  ■  un- 2] 

iii)  n2un  =  (-12n2  +  18n  -  7)  •  un~i  +  (2 n  -  3)2  •  u„_2; 

iv)  n2  •  un  =  (56n2  —  70n  -I-  23)  •  un_x  —  (4n  —  5)2  •  un_ 2. 

There  is  a  larger  class  of  rank  r  >  2  linear  recurrences  of  the  form 

r 

«2  * u* =  ^*(*0  ■ 
k=  1 

all  solutions  of  which  are  nearly  integral.  Many  of  these  recurrences  give  rise  to  new 
irrationalities.  E.g.  we  present  the  following  new  globally  nilpotent  equation  (r  =  3): 

4x(x3  +  16x2  +  77x  -  2)y"  4-  8(2x3  +  24x2  +  77x  -  l)y'  +  (9x2  +  70x  +  84)y  =  0. 


Recently,  studying  Lame  equations  we  discovered  new  classes  of  explicit  continued 
fraction  expansions  of  classical  (special)  functions,  related  to  problems  above.  These  con¬ 
tinued  fraction  expansions  include  many  Stieltjes-Rogers’  continued  fractions  and  are  re¬ 
lated  to  elliptic  theta-functions  [62-63]. 

Stieltjes-Rogers’  expansions  (see(33])  include  the  following  examples: 


1  1  •  22  •  3k2  3  •  42  •  5fc2  5  •  62  •  7 k2 

z2  -I-  a—  z2  +  32a—  z2  +  52a—  z2  +  7a2  —  . . . 


(3.1) 


z-  f  sn2[u,  k2)c~u*du  = 

Jo 


2  2  •  32  •  4k2  4  •  52  •  6fc2  6  •  72  •  8 k2 

z2  +  22a—  z2  +  42a—  z2  +  62a—  z2  +  82a  —  . . . 


a  =  k2  +  1. 

In  the  case  of  expansion  (3.1)  the  approximations  Pm/Qm  to  the  integral  in  the  left 
hand  side  of  are  determined  from  a  three-term  linear  recurrence  satisfied  by  Pm  and  Qm 


(2m  +  l)(2m  +  2)<0m+l(z)  =  (z  +  (2m  +  l)2a)<^m(z)  -  2m(2m  +  l)k2<t>m-i(z). 

Here  <t>m  =  Pm  or  Qm,  and  Qm  are  orthogonal  polynomials.  The  generating  function 
of  Qn  satisfy  a  Lam4  equation  in  the  algebraic  form  with  a  parameter  n  =  0. 

These  special  continued  fraction  expansions  cam  be  generalized  to  continued  fraction 
expansions  associated  with  any  Lame  equation  with  an  arbitrary  parameter  n. 

For  n  =  0  these  closed  form  expressions  represent  the  Stieltjes-Rogers  expansions.  For 
n  =  1  two  classes  of  continued  fractions  from  [27]  have  arithmetic  applications,  because  for 
three  values  of  the  accessory  parameter  H  (corresponding  to  e,-nontrivial  2nd  order  points) 
the  Lam4  equation  is  a  globally  nilpotent  one  and  we  have  p-adic  as  well  as  archimedian 
convergence  of  continued  fraction  expansions.  This  way  we  obtain  the  irrationality  and 
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bounds  on  the  measure  of  irrationality  of  some  values  of  complete  elliptic  integrals  of  the 
third  kind,  expressed  through  traces  of  the  Floquet  matrices.  Similarly,  for  an  arbitrary 
integral  n  >  1,  among  continued  fraction  expansions,  expressed  as  integrals  cf  elliptic 
^-functions,  there  are  2n  +  1  cases  of  global  nilpotence,  when  continued  fractions  have 
arithmetic  sense  and  orthgonal  polynomials  have  nearly  integral  coefficients. 

Among  new  explicit  continued  fraction  expansions  is  the  expansion  of  the  following 
function  generalizing  Stieltjes-Rogers: 

Jo  cr(u)a(u0) 

The  three-term  linear  recurrence  determining  the  J-fraction  for  the  corresponding 
orthogonal  polynomials  has  the  following  form: 

Qn{x)  =  QN-i{x)-{(l+k2)iN~l)2+x}+QN.2[xyk*iN~l)2-NiN-biN-h-{N-^). 

It 

Here  x  =  sn2(uo;fc2). 

The  more  general  J-fraction  of  the  form 


6n_i  +  x  - 


an~  1 


bn  +  x 


Wl  -r  X  ~ 


with 


an  =  k4-  n(n  +  !)•(«  +  j)(n  -  •  {(n  -  1)  •  (n  -  ^}; 


6n  =  (1  +  k 2)  •  (n  -  l)2  :  n  >  2 
is  convergent  to  the  integral  of  the  form 


/°°  n”i£[-UT-  -^-e-Z(u')udu. 

J0  0(u)0(ui) 


The  generating  function  of  the  corresponding  orthogonal  polynomials  is  expressed  in 
terms  of  solutions  of  a  Lame  equation  with  parameter  m  >  1. 

Arithmetic  applications,  particularly  to  the  determination  of  measures  of  irrationality, 
of  classical  constants  often  require  E-  or  G-function  representations.  Rapidly  convergent 
nearly  integral  power  series  expansions  are  the  most  efficient  way  to  construct  Pad6  approx¬ 
imations  and  determine  the  arithmetic  nature  of  classical  constants.  We  have  developed 
the  theory  of  new  identities  giving  such  representations  [32].  They  generalize  Ramanujan’s 
quasiperiod  relations  (34]  that  give  generalized  hypergeometri  series  of  multiple  of  l/n.  To 
introduce  Ramanujan’s  series  we  first  need  Eisenstein’s  series: 


Ek{r)  =  1  -  ■  qT 


Bk 


n=  1 
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for  <Tfc-i(n)  =  l,  and  q  =  e2niT.  In  the  Ek(r)  notations,  the  quasiperiod  relation 

is  expressed  by  means  of  the  function 


S’W  =  EM  '  (^W  -  JtaM1' 


(3.2) 


which  is  nonholomorphic  but  invariant  under  the  action  of  T(l). 

Ramanujan  proved  in  [34]  that  this  function  admits  algebraic  values  whenever  r  is 
imaginary  quadratic. 

The  Clausen  identity  gives  the  following  3F2-  representation  for  an  algebraic  multiple 
of  1/n,  following  from  (3.2): 


N1- 

(3n)!n!3 


1 

J(r)n 


(3.3) 


(- 2 


7T  2(d(1728  -  J(r))1/2 

Here  r  =  (l  +  \f—d)/ 2.  If  h{—d )  =  1,  then  the  second  factor  in  the  right  hand  side  is  a 
rational  number.  The  largest  one  class  discriminant  —  d  =  —163  gives  the  most  rapidly 
convergent  series  among  those  series  where  all  numbers  in  the  left  side  are  rational : 


Vfc  +  n\  (6n)!  (-1)"  _  io*u,^ur'- _ ^ 

1  (3n)!n!3  (640, 320)”  163  •  8  -  27  •  7  •  11  •  19  •  127  7 r' 


(640, 320) 3/2 


Here 


c  1  = 


13,591,409 


163  2  -9*7  -11  -19-127 

and  J(-lti/p65)  =  -(640, 320)3. 

Ramanujan  provides  instead  of  this  a  variety  of  other  formulas  connected  mainly  with 
the  three  other  triangle  groups  commensurable  with  T(l).  All  four  classes  of  3.F2  represen¬ 
tations  of  algebraic  multiples  of  1/n  correspond  to  four  3F2  hypergeometric  functions  (that 
are  squares  of  2f'Vrepresentations  of  complete  elliptic  integrals  via  the  Clausen  identity). 
These  are 

_(6n)!  ,  *^n 

2  ’  6  ’  6  1  1  X‘ 


n= 0  v  ' 


_  ,  1  3  I  ,  V*n)1 1  1  \ 

3 4  ’  4  ’  2 ’  1  ’ 1  ’ 1  “ 


(4n)!  js 


n=0 

00 


(2n)!3 ,  x 


n=s0 


r  / 1  2  1  ,  V  (*”)'■  KW-,  f  y 

3^2'3’  3’  2’  1,1  ^  n!3  n!2  ^33-22^ 

n=0 


(3n)!  (2n)!.  x 
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Representations  similar  to  (3.3)  can  be  derived  for  any  of  these  series  for  any  singular 
moduli  r  6  Q(>/-d)  and  for  any  class  number  h(—d),  thus  extending  Ramanujan  list  ad 
infinum.  For  a  simple  recipe  to  generate  these  new  identities,  see  [32]. 

Logarithms  of  quadratic  units,  like  n,  can  be  represented  as  values  of  convergent  series 
satisfying  Fuchsian  linear  differential  equations.  This  holds  for  log  eyj  of  a  fundamental 
unit  cyjf  of  a  real  quadratic  field  Q(v/&).  To  represent  this  number  as  a  convergent  series 
(in,  say,  1/J(r))  one  uses  Kronecker’s  limit  formula  expressing  this  logarithm  loge^  in 
terms  of  products  of  values  of  Dedekind ’s  A-function  (“Jugendtraum”). 

Ramanujan’s  and  other  similar  identities  that  express  n  and  other  similar  numbers  in 
terms  of  values  of  G-functions  very  close  to  the  origin,  give  us  the  basis  of  applications  of  our 
Pade  approximation  techniques  to  these  G-functions.  In  such  applications  the  exponent 
in  the  measure  of  diophantine  approximations  strongly  depends  on  the  proximity  of  an 
evaluation  point  to  the  point  of  expansion  of  a  G-function.  To  make  this  dependence 
explicit,  we  quote  the  following  our  result  [14): 

T/ieorem3.1[l4j.  Let  fi[x), . . . ,  /n(z)  be  G-functions  satisfying  linear  differential  equa- 
tions  over  Q(x).  Let  r  =  a/6,  with  integers  a  and  6,  be  very  close  to  the  origin.  Then  we 
get  the  following  lower  bound  for  linear  forms  in  /i(r),. . .  , /„(r). 

For  arbitrary  non-zero  rational  integers  Hi, . . .  ,Hn  and  H  ~  max{[Ffi|, ...» [Hn|}>  if 
Hnfn{r)  ^  0, 

|tfl/l(r)  +  ...  +  ff„/n(r)|  >  I tfx-.-tfnr1-#1-' 
provided,  that  r  is  very  close  to  0: 


|6|>Cl>P(ri'l+e) 

and  H  >  C2  with  effective  constants  c ,  =  c,(/i, . . .  ,/n,r,e).  If  r  is  not  as  close  to  0,  we 
get  only 

+  Hnfn{r)\  >  Hx~* 

for  A  =  -(n  -  1)  log  |6j/  log  |6/an|(<  0). 

We  will  present  some  of  the  results  for  numbers  connected  with  n  based  on  effective 
Pade  approximations  with  schemes  described  in  [29],  [35].  The  first  bound  is  connected 
with  Ramanujan-like  series: 

\irV2  -p/q\  >  [9|-16-67- 
for  rational  integers  p,q  :  |?i  >  qo- 

For  7r>/3  we  use  different  system  of  Pade-type  approximations  [28].  Below  we  present 
the  corresponding  integral  representations,  leading  to  the  bound: 

l*>/3  -pjq\  >  |gj"5-791- 


for  \q\  >92. 

To  cover  large  classes  of  numbers  we  generalize  the  Ramanujan  theory  quasi-period 
relations  to  the  general  GA/-varieties.  Particularly  interesting  applications  arise  for  arith¬ 
metic  triange  (quaternion  groups  [32]).  To  present  some  examples  of  such  relations,  we 
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look  at  the  automorphic  function  0(r)  for  the  arithmetic  triangle  group  T,  normalized  by 
its  values  at  vertices. 

An  analog  of  s2 (r)  in  (3.2)  that  is  a  nonholomorphic  automorphic  form  for  T  is 

_ L_  .  ifllll _ »'  > 

<p'(r)  *  <p'[t)  Im(r)  ‘ 


For  4>(t)  =  J(r)  one  gets  s2(r). 

For  example,  let  us  look  at  a  quaternion  triangle  group  (0;3;2,6,6).  In  this  case, 
instead  of  an  elliptic  Schwarz  formula  one  has  the  following  representation  of  the  normalized 
automorphic  function  4>  =  4>(t )  in  H  in  terms  of  hypergeometric  functions: 

r  +  i(y/j  +  v5)  3W  ,r(l/3)  a  F(A,j;  |;») 
r-i[V2  +  s/3  22  •  21/6  '  y/Z  ' 

The  role  of  ?r  in  Ramanujan’s  period  relations  is  occupied  in 

(0, 3;  2,6,6)-case  by  the  transcendence  {r^1t/3^}6. 

Thus,  generalizations  of  Ramanujan  identities  allow  us  to  express  constants,  such  as  7r 
and  other  T-factors,  as  values  of  rapidly  convergent  series  with  nearly  integral  coefficients 
in  a  variety  of  ways,  with  convergence  improving  as  the  discriminant  of  the  corresponding 
singular  moduli  increases. 

Another  interesting  dimension  in  all  these  identities  is  an  ability  to  add  p-adic  inter¬ 
pretation  to  them.  We  are  developing  now  a  variety  of  p-adic  analogs  of  such  identities  to 
be  used  for  diophantine  and  for  numerical  applications. 

We  refer  also  to  our  papers  [43],  [44]  and  [Maine]  for  a  variety  of  applications  of  ef¬ 
fective  Pade  approximation  techniques  to  measures  of  diophantine  approximations.  These 
techniques,  particularly  from  [29]  and  [37]  are  based  on  multiple  integrals  of  Pochham- 
mer  type  and  the  combination  of  symbolic  and  numerical  methods  to  determine  the  best 
sequences  of  dense  approximations  to  such  numbers  as  n,  In  2,  \/[3j2. 


17 


4.Computer  Algebra  and  Numerical  Computations. 


Our  work  on  computer  algebra  systems  included  the  development  of  new  algorithms  of 
symbolic  manipulation  with  algebraic  numbers  and  functions,  solution  of  algebraic,  differ¬ 
ential  and  integral  equations,  and  development  of  algorithms  combining  computer  algebra 
and  numerical  methods.  We  refer  to  [27]  for  description  of  computer  algebra  methods  aris¬ 
ing  from  elliptic  curves  and  Abelian  varieties  with  applications  to  number  theory,  including 
diophantine  approximations,  primality  testing  and  factorization,  to  algebraic  topology,  K 
-  theory  and  superstring  interpretation,  and  to  uniformization  theory. 

Some  of  our  methods  developed  for  the  study  of  arithmetic  properties  of  linear  differ¬ 
ential  equations,  were  used  in  the  development  of  computer  algebra  systems  themselves. 
This  includes  new  methods  of  power  series  manipulation,  and,  in  particular,  fast  algo¬ 
rithms  of  power  series  expansions  of  solutions  of  differential  equations  [31],  [61].  These 
methods  also  led  to  new  algorithms  of  explicit  integration  of  algebraic  functions,  including 
the  determination  of  all  cases,  when  this  is  possible  (this  is  closely  related  to  our  solution  of 
the  Grothendieck  problem,  see  [21],  [27]).  In  the  environment  of  computer  algebra  systems 
we  were  developing  some  new  number-theoretic  algorithms,  including  new  algorithms  of 
bignum  operations,  factorization,  and  new  modular  algorithms  of  fast  convolution.  All 
this  work  was  conducted  in  the  environment  of  traditional  computer  algebra  systems,  par¬ 
ticularly  within  MACSYMA  and  SCRATCHPAD  II  environments.  Of  these  we  mainly 
benefited  from  the  impressive  capabilities  of  SCRATCHPAD  (IBM),  which  showed  itself 
capable  of  handling  very  large  computational  tasks  and  providing  with  a  very  flexible  pro¬ 
gramming  support.  Our  involvement  with  SCRATCHPAD  benefited  our  theoretical  and 
applied  work,  and  gave  us  high  hopes  for  its  future  utilization,  especially  when  it  will  be 
widely  released. 

At  the  same  time  as  we  used  the  general  purpose  computer  algebra  systems,  we  are 
developing^  specialized  packages  for  specific  tasks  and  applications.  An  acute  need  in  such 
specialized  packages  is  felt  in  the  development  of  software  support  for  handling  large  com¬ 
putational  tasks  for  vector  and  parallel  machines  based  on  modeling  or  simulation  of  realis¬ 
tic  physical  or  engineering  problems.  Among  classes  of  problems  that  we  are  interested  in, 
we  would  like  to  mention:  solution  of  boundary  value  problems,  study  of  multi-dimensional 
elliptic  problems  for  domains  of  complex  boundary  structure,  development  of  parallel  al¬ 
gorithms  for  solution  of  two-dimensional  and  three-dimensional  aero-  and  hydro-  dynamic 
problems,  and  the  study  of  astrophysical  models,  combining  chemistry,  thermodynamics, 
gravitation  and  relativistic  effects.  Development  of  vectorized  and  parallelized  codes  for 
these  problems  is  necessary  because  large  realistic  modelling  is  possible  only  on  supercom¬ 
puters  and  massively  parallel  machines,  with  GigaFlops  of  performance  and  GigaBytes  of 
storage.  The  issue  of  parallelization  and,  particularly,  intercommunication  between  many 
processors  is  an  important  and  challenging  mathematical  problem  in  itself.  It  leads  to 
a  variety  of  outstanding  questions  of  complexity,  number  theory  and  graph  theory.  We 
present  in  the  Appendix  results  of  one  of  our  works  (jointly  with  M.  M.  Denneau)  on  the 
construction  of  optimal  sparse  networks. 

We  use  computer  algebra  systems  to  develop  optimized  numerical  algorithms  for  solu- 
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tion  of  specific  equations  and  modelling  of  physical  processes.  We  also  use  computer  algebra 
systems  to  map  developed  algorithms  to  vector  or  parallel  organizations  of  large  machines 
and  supercomputers.  Both  of  these  tasks  are  so  involved  and  time  consuming  that  with¬ 
out  support  of  the  computer  algebra  environment  they  cannot  be  efficiently  accomplished. 
As  an  example  of  our  involvement,  let  us  describe  our  work  on  optimization  of  two  -and 
three-  dimensional  aero  -and  hydro-  dynamic  codes.  Part  of  this  work  started  a  few  years 
ago,  when  we  together  with  K.  Prendergast  (Dept,  of  Astrophysics,  Columbia  University), 
conducted  numerical  simulation  of  underground  explosion  in  cavities  of  varying  geometry 
for  the  Hudson  Institute  and  DOD.  The  code  was  the  so-called  beam-scheme,  quite  robust 
general  hydrodynamics  code  used  in  many  problems  of  fluid  2nd  flow  dynamics. 

Recently  we  returned  to  fluid  and  flow  problems,  when  we  started  to  look  on  ex¬ 
amples  of  large  sale  problems  that  cannot  be  solved  without  parallel  processing,  due  to 
their  numerical  complexity  and  memory  requirements.  One  of  the  best  examples  of  such 
modeling  is  provided  by  astrophysical  and  cosmological  numerical  experiments.  In  these 
experiments  the  goal  is  to  simulate  all  known  complexities  of  galaxy  formation  and  evolu¬ 
tion.  This  includes  interactions  of  stars,  interstellar  matter,  gravitational  fields,  chemistry 
and  thermodynamics  and  the  presence  of  dark  matter.  Three  -  dimensional  simulations, 
involving  large  number  of  objects  (requiring  at  least  a  million  of  grid  points)  over  a  long 
period  of  galaxy  life  are  some  of  the  most  numerically  and  memory  intensive  challenging 
tasks.  Optimization  of  numerical  algorithms  and  their  parallelization  are  necessary  in  or¬ 
der  to  tackle  these  problems  and  give  to  astrophysicists  sufficient  amount  of  data  to  study, 
and  to  develop  better  models  of  physical  processes. 

Our  current  work  involves  three  -  dimensional  multifluid  models,  combining  chem¬ 
istry,  thermodynamics,  gravitational  potential  and  global  gravitational  effects,  and  a  de¬ 
tailed  look  at  each  cell.  In  development  of  better  numerical  methods  computer  algebra 
techniques  and  tools  were  an  important  component.  The  code  itself  is  extremely  floating 
point  intensive,  because  it  involves  at  every  discrete  time  step  computations  of  a  variety 
of  elementary  and  special  functions,  including  EXP,  ERFC  (error  functions)  and  several 
complicated  integrals.  For  example,  per  grid  point  (a  million  of  them),  per  discrete  step 
(about  10s  of  them),  one  has  to  compute  with  high  aceuarcy  values  of  4  integrals  of  the 
form 

/o°°  x*  sqrtx2  +  c2 

sqrtx 2  +  e 2  dx. 

Standard  numerical  quadrature  methods  (e.g.  Gauss  -  Hermite)  and  library  calls  (sach 
as  NAG,  IMSL,  CRAY,  IBM  ESSL-V,  etc.)  make  this  part  of  computations  prohibitively 
expensive.  Using  our  methods  of  fast  computation  of  solutions  of  linear  differential  equa¬ 
tions  we  developed  with  help  of  computer  algebra  systems  a  fast  method  of  high  precision 
evaluation  of  this  and  other  similar  integrals.  Such  a  development,  combined  with  the  full 
parallelization  now  allows  to  implement  three  -  dimensional  code  on  multiprocessor  ma¬ 
chines,  and  make  the  runtime  of  the  code  tolerable  for  modeling  of  a  variety  of  initial  and 
boundary  conditions.  We  are  now  running  a  variety  of  new  codes  for  these  astrophysical 
problems  in  two  fluid  version  for  systems  with  spherical  and  cylindrical  symmetries  on  a 
variety  of  general  purpose  and  special  computers  (some  of  the  hardware  designed  by  us). 
We  are  preparing  now  for  large  runs  of  three-dimensional  codes  on  supercomputers. 
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5.  Basic  Algebraic  Complexities. 


Algebraic  complexities  were  originally  developed  to  describe  the  number  of  opera¬ 
tions:  multiplications  (multiplicative  complexity)  and  additions  (additive  complexity)  in 
traditional  algebraic  structures:  rings  of  matrices  and  rings  of  polynomials.  While  the 
operation  of  addition  in  such  rings  is  straightforward,  it  is  by  no  means  obvious  how  to 
realize  the  operation  of  multiplication  in  these  rings  in  the  least  number  of  operations  on 
primitives  (numbers),  particularly  in  the  least  number  of  multiplications. 

The  algebraic  complexity  problem  which  is  the  richest  in  its  underlying  structure  is 
the  problem  of  fast  polynomial  multiplication.  Among  other  problems  reducible  to  that 
one  can  mention:  fast  multiplications  of  multiple-precision  numbers,  gdc’s  in  polynomial 
rings,  Hankel  matrix  multiplication,  computation  of  rational  and  Pad6  approximations, 
computation  of  finite  Fourier  transform  at  ions, ...etc.  Significant  progress  in  this  problem, 
due  to  Winograd,  Fiduccia,  Strassen  and  others,  was  concentrated  mainly  on  minimal 
multiplicative  complexities  of  polynomial  multiplication  over  infinite  fields.  For  practi¬ 
cal,  particularly  hardware  implementations,  and  in  applications  to  fast  multiplication  of 
multiple  precision  numbers  all  algorithms  have  to  be  considered  over  a  ring  Z  or  Z[l/2. 

We  have  developed  new  low  complexity  algorithms  arising  from  interpolation  on  al¬ 
gebraic  curves  of  positive  genus  and  on  arbitrary  algebraic  and  arithmetic  surface.  We 
precede  the  description  of  these  algorithms  with  a  short  exposition  of  complexity  count 
and  Toom-Cook  ordinary  interpolation  method. 

The  (multiplicative)  complexity  is  well  defined  for  systems  of  bilinear  forms  [38-39]. 
Let  A  be  a  ring  (of  scalars),  and  suppose  given  s  bilinear  forms  in  variables  x  =  (i  1*  •  •  •  »  %m) 
and  y  =  (yi , . . . ,  yn)  with  coefficients  from  A: 


m  n 

*•  *  ®  i. •  •  •  .«•  (51) 

One  of  the  most  widely  used  definitions  of  the  multiplicative  complexity  of  computa¬ 
tion  of  a  system  (5.1)  over  A  is  that  of  the  range  of  the  m  x  n  x  a  tensor  (**„,,*)  over  A 
(Strassen).  A  nonzero  tensor  T  is  said  to  be  of  rank  1  over  A.  If  there  are  three  A-vectors 
(ol,...,am),(6l,...,6n),(cl,...,c.)  such  that 

U.j.k  =  '  bj  '  Cfc  for  all 

Then,  the  rank  of  a  tensor  T  over  A  is  the  minimal  number,  $a»  such  that  T  is 
expressible  as  a  sum  of  rank  1  tensors  over  A.  This  scheme  of  evaluation  of  (5.1)  is 
called  a  bilinear  scheme  (normal  or  noncommutative).  In  this  scheme  for  the  evaluation 
of  a  system  of  bilinear  forms  (5.1)  one  forms  6  products  of  linear  combinations  of  i's  and 
y's: 

wi  =  {anx !  +  ...  +  aimrm)(6/lyl  +  . . .  +  binyn ),  (5.2a) 
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/  =  1, . . .  ,6;  and  then  obtains  z's  in  (5.1)  as  linear  combinations  of  these  products: 

zk  =  Ckiu>i  +  . . .  +  ■  k  =  1, ...  ,s.  (5.26) 

In  this  definition  of  multiplicative  complexity  one  counts  only  non-scalar  (essential) 
multiplications,  i.e.  6  multiplications  in  (5.2a). 

For  future  reference,  we  express  the  bilinear  algorithm  (5.2a)-  (5.2b)  of  computation 
of  bilinear  forms  (5.1)  in  the  following  algebraic  form 


z  =  C  ■  (Ax  ®  By) 


(5.3) 


for  matrices  C  =  (ckl)(€  M.xS(A)),  A  =  (au)(e  MSxm( A)),  B  =  (6,y)(€  M«Xn(A)). 

If  one  extends  the  ring  of  scalars  by  inverting  primes  and  by  adding  algebraic  num¬ 
bers  (most  notable  roots  of  unity),  one  can  often  significantly  reduce  the  multiplicative 
complexity  of  computation  of  systems  of  bilinear  forms.  This  is  the  case  of  polynomial 
multiplications.  Unfortunately,  simultaneously  the  total  computational  complexity  of  the 
algorithms  (as  reflected  in  the  total  number  of  bit  operations)  becomes  unbearably  high. 
That  is  why  the  "least  complex”  algorithms  of  polynomial  multiplication,  developed  by 
Toom-Cook-Winograd  [38-39]  are  unattractive  in  practice  for  (relatively)  large  degrees. 
We  describe  briefly  these  algorithms  because  they  provide  clues  to  the  use  of  interpolation 
technique. 

The  basic  idea  underlying  the  Toom-Cook  method  is  to  use  interpolation  is  reconstruc¬ 
tion  of  coefficients  of  a  product  of  two  polynomials.  Namely,  let  us  consider  the  bilinear 
problem  (5.1)  corresponding  to  the  multiplication  of  two  polynomials  of  degree  m  —  1  and 
n  —  1,  respectively: 

P(<)  =  «(<!  =  (5.4) 

•=1  4=1 


uct: 


One  is  interested  in  the  efficient  determination  of  coefficients  zk  of  a  polynomial  prod- 

m+n-l 

K(i)  ='/>(<)  •  QW.  m  =  Y.  (5.5) 

k=\ 


A  "high  school”  scheme  of  evaluation  of  (5.5)  takes  mn  multiplications  (but  no  scalar 
multiplication).  Toom  noticed,  however,  that  one  can  identify  i?(t),  if  one  knows  its  values 
atm  +  n-1  distinct  points  ak  :  k  =  1, . . . , m  +  n  —  1  in  terms  of  the  Lagrange  interpolation 
formula: 


m  +  fi—  1 

*(<)=  Y 

k  =  l 


Hi^fc(t  -  ai) 

n<*fc(afc  -  aj) 


(5.6) 


Thus  if  one  assumes  the  ring  A  to  include  a,-  and  (a,-  —  Q})~1  (for  i  ^  j),  then  one 
can  reconstruct  ail  m  +  n  -  1  coefficients  zk  of  R(t)  from  R(ak)  using  scalar  multiplications 
only.  To  compute  iZ(a*)  one  needs  then  only  m  +  n  —  1  nonscalar  multiplications: 


*(ak)  =  P(a*)-Q(afc), 


(5.7) 
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where  P(c*k)  and  Q(a*)  are  computed  by  the  Horner  scheme  in  (5.4)  with  scalar  (from  A) 
multiplications  only. 

The  Toom-Cook  scheme  was  generalized  by  Winograd,  who  showed  that  one  can  eval¬ 
uate  R(t)  mod  arbitrary  relatively  prime  polynomials  Da(t),  and  then  bring  the  results 
together  using  the  Chinese  remainder  theorem  and  without  any  nonscalar  multiplications 
to  obtain  R(t)  mod  na  Da{t).  Toom-Cook  took  Da(t)  =  t  -  a,  while  Winograd  consid¬ 
ered  Da(t)  as  cyclotomic  polynomials.  Optimal  results  of  polynomial  multiplication  and 
multiplications  in  algebras  A[x]/(!T(x))  (i.e.  polynomial  multiplication  modT(x)),  when 
A  =  K  is  a  infinite  field ,  are  described  by  the  following  Winograd’s 

Theoremb.l:  Let  K  be  an  infinite  field.  Then  the  multiplicative  complexity,  6/r(m,n), 
of  multiplication  of  polynomials  of  degrees  m  -  1  and  n  -  1  over  K[x]  is  m  +  n  -  1  exactly. 
The  multiplicative  complexity,  6k{T) ,  of  multiplication  of  polynomials  in  K[x]  mod  T(x) 
for  T(x)  €  K[x\  is  equal  to  2 n  -  k ,  where  k  is  a  number  of  distinct  irreducible  factors  of 
T(x)  in  K\x\. 

Though  these  results  are  strong,  none  of  them  can  be  applied  in  practice  (starting 
from  m  >  4,  n  >  4)  because  in  any  of  the  schemes  of  minimal  multiplicative  complexity 
(over,  say,  Q),  the  scalar  multiplications  not  counted  as  “actual  multiplications”-involve 
scalars  that  are  too  large  to  be  bounded  as  just  a  few  more  additions.  Also  one  has  to 
divide  by  large  integers  having  no  advantageous  binary  structure. 

Not  only  the  large  sizes  of  scalars  make  the  number  of  additions  prohibitively  -  ex¬ 
ponentially  -  large,  but  the  coefficients  become  large  integers  requiring  fast  bignum  multi¬ 
plications.  The  total  number  of  operations,  counted  in  terms  of  single  precision  additions 
and  multiplications,  in  the  “minimal  multiplicative  complexity”  algorithms  of  Theorem 
5.1  significantly  exceeds  the  number  of  operations  in  the  standard  high  school  methods  of 
polynomial  multiplication.  This  makes  the  minimal  multiplication  complexity  algorithms 
of  Theorem  5.1  unsuitable  for  practical  implementation.  The  minimal  multiplication  rou¬ 
tines  of  Theorem  5.1  found  their  place,  though,  in  Winograd’s  short  prime  length  DFT 
algorithms;  they  are  well  suited  for  iterations  when  one  uses  them  only  for  short  data  sizes. 

It  is  more  efficient  in  practice  to  use  fast  multiplication  routines  with  as  little  divi¬ 
sion  by  scalars  as  possible  (and  no  prime  inversions!).  From  the  point  of  view  of  hardware 
realization  it  is  preferable  to  have  division  by  power  of  2  only,  i.e.  one  should  consider  min¬ 
imal  multiplication  schemes  over  A  =  Z(l/2j.  It  was  realized  some  time  ago  ([Schonhage- 
Strassen]  [41-42])  that  one  can  achieve  fast  multiplication  of  polynomials  with  division  by  2 
only,  if  one  considers  polynomial  multiplications  modulo  cyclotomic  polynomials  (x2  —  1). 
This  general  scheme,  coupled  with  fast  Fourier  transforms  in  finite  fields  (modulo  factors  of 
Fermat  numbers)  was  used  first  by  [41]  to  achieve  asymptotically  fast  multiplication  of  large 
integers.  In  that  method  one  can  multiply  two  n-bit  integers  in  time  O(nlognloglogn) 
(i.e.  in  the  many  bit-operations). 

We  see  that  linear  upper  bounds  for  multiplicative  complexities  of  polynomial  mul¬ 
tiplication  (given,  say  in  Theorem  5.1)  imply  subexponential  lower  bounds  for  additive 
complexities  of  these  algorithms.  The  best  upper  bounds  for  the  multiplicative  complexity 
of  computation  of  polynomial  multiplication  via  certain  versions  of  FFT  are  O(nlogn)  for 
polynomials  of  degrees  bounded  by  n,  even  though  divisions  by  powers  of  2  (shifts)  are 
necessary  in  this  scheme.  No  nonlinear  lower  bound  is  known  for  algebraic  complexities  in 
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this  problem,  and  the  best  total  complexity  known  is  O(nlognloglogn). 

To  see  what  can  be  the  Z-algorithms  of  fast  polynomial  multiplication,  we  have  to 
consider  first  the  reductions  mod  p,  and  to  look  at  algebraic  complexities  of  polynomial 
multiplications  of  A  =  Fp,  particularly  over  A  =  F2. 

Over  finite  fields  there  is  no  simple  answer  to  the  minimal  multiplicative  complexity 
of  polynomial  multiplication  like  one  given  in  Theorem  5.1  for  infinite  fields.  If  SA(m,n) 
denotes  the  minimal  multiplicative  complexity  of  multiplication  of  polynomials  of  degrees 
m  -  1  and  n  -  1,  respectively,  with  a  ring  of  constants  A,  then  we  always  have  Sx(m,n)  > 
m  +  n  —  1  for  an  arbitrary  field  K  of  scalars,  but  this  inequality  becomes  equality  only 
when  K  has  at  least  m  +  n  —  2  elements. 

As  it  turns  out,  lower  bounds  for  multiplicative  complexities  over  finite  fields,  and,  as 
a  consequence,  over  the  ring  A  of  scalars,  are  much  stronger  than  the  one  given  above.  To 
obtain  them,  though,  the  algebraic  theory  of  linear  codes  has  to  be  introduced. 

Let  us  recall  the  basics  of  linear  codes.  In  the  theory  of  linear  codes  one  considers 
vector  spaces  F£  of  dimension  n  over  a  finite  field  F,.  A  linear  subspace  is  called  a  linear 
code.  A  linear  code  is  the  null  space  of  a  parity  check  matrix  of  the  code,  and  a  basis  of 
the  code  form  the  rows  of  the  matrix,  called  a  generator  matrix.  In  addition  to  n,  two 
more  parameters:  k  and  d  are  associated  with  a  code  (called  an  [n,  k,  d]-code).  First,  we 
denote  by  k  the  dimension  of  the  code  over  Ffl.  Second,  by  the  weight  of  the  code,  denoted 
d,  we  understand  the  minimal  number  of  nonzero  coordinates  of  all  nonzero  vectors  from 
the  code  with  respect  to  a  fixed  basis  of  the  vector  space. 

We  will  demonstrate  the  relationship  between  (multiplicative)  complexity  of  multipli¬ 
cation  in  fc-algebras  and  the  Hamming  problem  for  linear  codes  over  k  in  the  most  general 
situation: 

Corollaryb. 2  :  If  A  is  a  F,-  algebra  of  dimension  n  over  Tq  and  without  zero  divisors, 
then  every  realization  of  multiplication  in  A  over  F,  as  a  bilinear  algorithm  with  6  = 
6f,( A)  nonscalar  multiplications  over  F,  gives  rise  to  a  [5,  n,n]-linear  code  over  F„. 

Corollary  5.2  includes,  in  particular,  all  finite  extensions  of  prime  fields. 

The  proof  of  Corollary  5.2  also  provides  important  clues  as  to  matrices  A,  B  and  C 
in  the  algebraic  form  (5.3)  of  the  algorithm  of  multiplication  in  /e-algebra  A. 

Using  Corollary  5.2  and  known  lower  bounds  from  the  theory  of  linear  codes,  one  can 
bound  from  below  the  multiplicative  complexity  of  polynomial  multiplication.  One  of  the 
best  linear  code  bounds  is  (MRRW)  proved  in  [43].  According  to  this  bound  for  q  =  2,  if 

n  —*  oo  and  there  is  a  sequence  of  [n,  ifc,dj-codes  with  d/n  —*  6,  then  R  A=  k/n  <  #2(l/2- 
-  <$2).  Here  the  entropy  function  is  Hq( x)  =  -xlog^x  -  (1  —  x)logfl(l  -  z).  Such 
bound  was  used  in  [44]  to  bound  the  multiplicative  complexity  5p}(n,m)  of  multiplication 
over  Fa  of  polynomials  of  degrees  n  -  1  and  m  -  1  respectively:  6F,(n,n)  >  3.52  •  n,  and 
<5z(n,n)  >  3.52  •  n  for  a  sufficiently  large  n. 

Our  new  results  show  that  similar  lower  bounds  hold  for  multiplicative  complexity 
of  multiplication  in  finite  extensions  of  F,.  Indeed  let  K  =  F,[t]/p(f),  for  an  irreducible 
polynomial  p{t)  of  degree  n  in  F ,[<],  so  K  ~  Fq+  ;  and  let  tfFf(,<)  denote  the  minimal 
multiplicative  complexity  in  the  field  K  over  F,.  Then  Corollary  5.2  implies  that  there 
exists  a  [£pf (K), n,  n]-linear  code  over  F,.  Thus  we  deduce  the  lower  bound 

6Fl(K)  >  3.52n  (5.8) 
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for  sufficiently  large  n  =  [K  :  F2]. 

For  q  >  2  one  gets  less  sharp  bounds  from  known  bounds  on  optimal  codes.  For 
example,  the  Plotkin  bound  implies 

<5f,(K)  >  (2  +  ■  n 


as  n  [K  :  Ffl]  — ►  00. 

We  found  an  important  phenomenon  for  polynomial  multiplication  over  finite  fields 
drastically  different  from  the  infinite  field  case.  In  the  case  of  an  infinite  field  of  constants 
k ,  Winograd’s  Theorem  5.1  shows  that  multiplication  mod  an  irreducible  polynomial  p{t) 
of  degree  n  takes  as  many  essential  multiplications  as  that  of  multiplication  of  two  poly¬ 
nomials  of  degree  n  —  1.  It  is  no  longer  true  in  the  finite  field  case,  when  we  always 
have  4(k[*]/(p(0))  +  C  -n  <  5fc(n,n)  for  a  positive  constant  C(=  C(fc))  and  an  arbitrary 
polynomial  p[t)  of  degree  n  over  a  finite  field  k.  We  conjecture,  that  in  fact 


lim 

n— *oo,'X:fc]=n 


4  (rc,rt) 

4(K) 


=  2.0. 


The  lower  bounds  for  multiplicative  complexities  of  polynomial  multiplication  are 
always  linear  in  n,  and  seem  far  from  the  best  nonlinear  upper  bound  O(nlogn)  for  finite 
field  polynomial  multiplication.  We  have  shown  that  the  upper  bound  can  be  brought 
down  to  a  linear  one  with  the  constant  comparable  to  that  of  the  lower  bound. 

To  describe  new  algorithms,  we  look  at  arbitrary  meromorphic  (rational)  functions. 
These  rational  functions,  like  polynomial  can  be  represented  in  a  variety  of  ways.  They 
can  be  represented  as  P{t)/Q(t),  in  terms  of  its  residues:  Co  +  c»/(t  -  a,),  or  by  its 

values  at  a  given  set  of  points.  The  last  representation  is  the  interpolation.  Interpolation 
formulas  are  the  basis  of  the  Toom-  Cook  algorithm  of  the  fast  polynomial  multiplication. 
Apparently  the  interpolation  algorithm  always  has  the  lowers  multiplicative  complexity. 
The  most  important  polynomial  multiplication  problem  is  that  of  multiplication  in  a  finite 
extension  K  =  fc[t]/((p(0)  f°r  a  fixed  p(t)  €  fc[t]  and  field  of  constants  k.  Let  4(^C)  be  the 
multiplicative  complexity  of  multiplication  in  K  over  k.  For  an  infinite  k  and  K  of  degree 
n  over  k,  6k(K )  =  2n  —  1  by  Theorem  5.1,  and  all  algorithms  realizing  this  multiplicative 
complexity  are  interpolation  algorithms,  interpolating  products  z(t)  •y(t)modp(f)  at  2n  —  1 
distinct  points  of  kPl.  What  happens  if  it  is  finite,  say  F2?  There  are  not  enough  points 
interpolate  at.  What  usually  is  done,  is  an  extension  of  the  field  of  constants  (till  there  are 
enough  points  for  interpolation)  and  the  nested  rule  for  multiplication  in  the  composition 
extension  of  fields.  This  is  expressed  by  a  trivial  “multiplication  rules”: 


^k(^)  <  Mfc(£)  1  Pr(^) 


(5.9) 


where  k  C  Z  C  K .  The  rules  (5.9)  are  the  basis  of  all  previously  known  and  currently  used 
fast  convolution  algorithms  (including  Nussbaumer’s  and  Schonhage-  Strassen  technique). 

We  found  a  novel  way  of  interpolation  by  means  of  representing  the  set  of  points 
where  one  interpolates  as  a  divisor  on  an  algebraic  curve.  To  represent  this  interpolation 
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in  a  more  abstract  way  one  has  to  use  the  standard  language  of  places  from  the  theory  of 
algebraic  functions  of  one  variable.  See  [45-46]. 

By  a  place  of  a  field  K  we  understand  an  isomorphism  <f>  :  K  — ►  E  U  {oo},  where  E 
is  a  field  and  </>(a)  =  oo,  <f>(b)  ^  0,  oo  for  a,  b  6  K.  There  is  an  obvious  correspondence 
between  places  and  valuations  (which  we  will  use).  All  places  on  an  algebraic  function  are 
extensions  of  places  from  corresponding  rational  function  field,  and  places  of  a  rational 
function  field  K  —  k(x)  over  a  field  d  correspond  either  to  (i)  irreducible  polynomials  p( x) 
in  k[x],  or  to  (ii)  x-1.  We  deal  only  with  algebraic  function  fields  in  one  variable.  Any 
such  field  K  over  the  field  of  constants  k  can  be  represented  in  the  form  K  =  k(x, y), 
where  x  is  (any)  transcendental  element  of  K  over  k ,  and  /,...,  yd~ 1  is  the  basis  of  K  over 
fc(x),  [K  :  lk(x)]  =  d.  For  an  arbitrary  place  ?  of  K  let  kp  be  a  field  such  that  P  is  an 
isomorphism  onto  kp  U  {oo}.  We  denote  by  vp  the  normed  valuation  with  values  in  Z, 
corresponding  to  P.  The  degree  fp  of  kp  over  k  is  called  the  degree  of  P.  A  divisor  of  K 
is  an  element  of  the  free  Abelian  group  generated  by  the  set  of  places  of  K.  The  places 
themselves  are  called  prime  divisors.  We  write  divisors  additively:  A  =  Lpvp(A) -P ,  where 
vp(A)  are  integers  among  which  only  finitely  many  are  nonxero.  A  divisor  A  is  called  an 
integral  one  if  vp(A)  >  0  for  every  P.  A  divisor  A  divides  8,  if  8  —  A  is  integral.  The 
degree  d{A)  of  a  divisor  A  is  an  integer  d(A)  =  T.pfpvp[A).  With  every  element  X  €  K 
one  associates  a  principal  division  (X)  =  E pvp(x)  •  P. 

The  most  important  object  is  the  vector  space  L(— A)  over  k  consisting  of  all  functions 
X  €  K  such  that  the  divisor  (X)  +  A  is  positive.  If  g  denotes  the  genus  of  K,  then  the 
dimension  of  L{  —  A)  over  k  is  determined  by  the  Riemann-Roch  theorem.  To  formulate  it, 
we  denote  by  C  an  equivalence  class  of  divisors  in  K  (modulo  the  principal  ones-  (X)  for 
X  €  K)y  and  by  N(C)  the  dimension  of  C,  i.e.  the  maximal  number  of  linearly  independent 
integral  divisors  in  this  class.  Then  N(C)  is  equal  to  dimL(-A)  for  any  A  from  C.  The 
Riemann-Roch  theorem  states  that 

N{C)  =d(C)-j7  +  l  +  »'(C),  (5.10) 

where  *(C)  =  0  is  the  index  of  speciality  of  C,  i(C)  =  jV(W  —  C),  where  W  is  the  class  of 
all  differentials  on  K  (canonical  class).  In  particular,  N(C)  =  d{C)  - g+  1  if  d(C )  >  2g  —  2 
or  d(C)  =  2g  —  2  and  C  ^  W . 

In  applications,  k  is  often  a  finite  field  of  characteristic  p  >  0  with  q  =  pm  elements.  If 
P  is  any  prime  divisor  of  K- an  algebraic  function  field  with  the  field  of  constants  fc-then 
the  number  of  elements  in  the  residue  field  kp  is  called  the  norm  of  P  and  is  denoted 
as:  fV(P)  =  qd^?\  This  definition  is  extended  to  all  divisors:  N(A)  =  qd^AK  With  an 
algebraic  function  field  K/k  one  can  associate  a  ?-  function: 

f(K;.)  =  £(JV(*))-,j>l 

A 

where  A  runs  over  all  integral  divisors  of  K/k. 

The  interpolation  technique  on  a  curve  corresponding  to  K/k  proceeds  as  follows.  To 
represent  a  finite  extension  kp  of  k  of  degree  d(P)  one  looks  for  a  positive  divisor  8  such 
that  the  natural  mapping 

P-.L{-B)-+kp 
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is  onto.  This  way  we  find  a  basis  of  kp  over  k  among  element  of  L{  —  S).  E.g.  if  K  =  k(t)  is  a 
rational  function  field,  and  P  corresponds  to  p(t),  one  takes  B  =  (n  — l)-oo  for  n  =  degp(t), 
i.e.  looks  for  a  power  basis  0,  l,...,fn-1  of  an  algebraic  extension  kp  =  k[t}/p(t))  of  k. 

The  multiplication  law  in  the  field  kp  over  k  can  be  then  represented  in  terms  of  the 
multiplication  rule  L(-B)  x  L(-B)  — »  L(—2  •  B).  It  remains  now  to  reconstruct  functions 
from  L(-2B)  by  interpolation.  For  this  one  could  take  a  set  D  of  divisors  of  the  first  degree 
(places  of  the  first  degree)  such  that  card(P)  >  2d(B). 

This  general  method  is  summarized  in  the  following  statements  from  [45-46J  where  for 

an  arbitrary  integral  (positive)  divisor  A  on  K,  we  put  k x  =  Kj A  =  Kmo&A. 

Corollary 5.3  :  Let  K  be  a  function  field  over  k  of  genus  g  >  0  and  let  A  be  a 
prime  divisor  of  degree  n  >  1  on  K.  Let  So  be  a  nonspecial  integral  divisor  on  K , 
i.e.  dimfcL(— So)  =  d(So)  —  g  +  1,  such  that  S  =  So  +  A  is  a  nonspecial  divisor  too.  Let 
there  be  D  prime  divisors  of  first  degree  on  K  for  D  >  2d(S).  Then  there  exists  a  bilinear 
algorithm  over  k  that  computes  the  multiplication  in  the  field  extension  k *  of  k  of  degree 
n  of  multiplicative  complexity  at  most  2d(S)  —  g  +  1  =  2n  +  2d(So)  —  g  +  1. 

Let  A  be  an  arbitrary  prime  divisor  on  K  of  degree  n,  and  kj,  be  its  residue  class 
field  which  is  an  extension  of  k  of  degree  n.  Let  us  denote  by  #K(k)  the  number  of  first 
degree  divisors  on  K  over  k.  It  follows  from  Corollary  5.3  that  whenever  there  exists  a 
nonspecial  divisor  So  on  K  of  degree  m  such  that  m  +  n  >  2g  —  1  and  #if(fc)  >  2m  +  2n, 
the  multiplicative  complexity,  £*(£*),  of  computation  of  multiplication  in  fc*  over  k ,  does 
not  exceed  2m  +  2n  —  g  +  1. 
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6.  Diophantine  Geometry  with  Applications  to 
Low  Complexity  Algorithms. 

As  results  of  §5  show,  to  construct  low  complexity  algorithm  of  multiplication  in  k- 
algebras,  associated  with  function  rings,  one  needs  a  supply  of  points  (prime  divisors  of 
the  first  degree).  Thus  the  main  problem  becomes  the  existence  of  a  large  number  of 
(appropriate)  rational  points  on  curves  of  high  genus-  a  classical  diophantine  problem. 

First  of  all  we  need  the  existence  of  some  auxiliary  divisors  for  Corollary  5.3.  For 
this  we  use  the  properties  of  the  f-function  Z(u)  =  f(A’;s),  u  =  q*.  Let  us  denote  by 
Nprimen  the  number  of  prime  divisors  on  K /k  of  degree  n.  It  follows  that  d/du  log  Z(u)  = 
£r=i  »n“l{£<f|n  -N>ri mea  •  d}.  We  obtain: 

2cr 

+  1  -  Yi  =  YL  NV Time* ' d 

*—  1  d\n 

for  any  n  >  1.  Since  |  w*  |=  y/q  for  all  *  =  1, . . .  ,2g,  we  deduce: 

a)  for  every  n>  •  log(g)  /  log  q  there  is  a  prime  divisor  of  degree  n  on  K ; 

b)  for  every  m  >  g  +  c2  \og(g  •  q)  there  exists  a  nonspecial  positive  divisor  of  degree  m 
on  K. 

In  order  to  have  low  multiplicative  complexity  algorithms  arising  from  algebraic  curves 
one  sees  that  the  remaining  problem  is  the  construction  of  algebraic  curves  over  a  fixed 
field  k  —  F,  having  the  maximal  number  of  divisors  of  the  first  degree  for  a  given  genus  g, 
as  g  — ►  oo. 

The  most  general  bound  on  the  number  #K(k)  of  points  on  K  over  k  (prime  divisors 
of  first  degree)  is  given  by  Weil’s  bound:  |  Nk{K)  —  q  -  1  |<  2gs/q.  As  g  — *  oo,  the  upper 
bound  in  the  Weil  theorem  is  unattainable.  In  fact,  relatively  simple  considerations  (cf. 
[47])  show  that 

lim  <y/g-l.  (6.1) 

g—co  g 

On  the  other  hand,  there  are  positive  results  (47-48]: 

Proposition 6.1 :  Whenever  q  is  square,  the  equality  in  (6.1)  is  attainable. 

For  example,  on  every  classical  modular  curve  of  level  m  and  genus  <7,  (m,p)  =  1, 
there  are  at  least  (p  —  l)(s  —  1)  points  over  Fpj. 

Choosing  curves  over  finite  fields  with  many  points  on  them  (Fermat  curves,  con¬ 
gruence  curves)  we  obtain  a  variety  of  new  low  multiplicative  complexity  algorithms  of 
polynomial  multiplication. 

Theorem6.2  :  Let  q  be  a  square  >  25.  Then  the  multiplicative  complexity  ^(F,*) 
of  multiplication  in  the  field  F,»  over  F,  can  be  bounded  as  follows: 

*rf(Ff.)  <  n  ■  2  •  (1  +  -^=— — )  +  o(n) 
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as  n 


— »  oo. 

For  q  =  2  we  have  the  following  upper  and  lower  linear  bounds  on  multiplicative 
complexities  (see  §4  and  the  multiplication  rule  (5.10)): 


3.52  •  n  <  <5fj(F2»)  ^  6n 


(6.2) 


for  a  large  n. 

For  F2  and  moderate  n  and  m  (below  100)  the  best  algorithm  of  polynomial  multi¬ 
plication  over  F2  has  multiplicative  complexity  of  linear  character  with  a  constant  around 
4. 

All  algorithms  of  Theorem  6.2  and  (6.2)  are  constructive  (one  can  construct  appro¬ 
priate  algebraic  curve  codes  in  polynomial  time). 

The  relationship  with  the  linear  coding  theory,  indicated  in  §5,  extends  further.  Ac¬ 
cording  to  §5,  with  every  multiplication  algorithm  one  can  associate  an  appropriate  code, 
and  low  minimal  complexity  algorithms  give  rise  to  very  good  codes.  Our  algorithms  (The¬ 
orem  6.2  and  others  [45-46])  lead  to  Goppa-like  linear  codes  arising  from  algebraic  curves 
over  finite  fields  [48]. 

New  algorithms  for  polynomial  multiplication  (convolution)  over  finite  fields  are  to  be 
extended  to  algorithms  over  Z[l/2]  or  Z  to  be  particularly  useful  in  applications  to  bignum 
computations.  Here  we  encounter  a  variety  of  diophantine  problems  connected  with  the 
number  of  solutions  of  classical  diophantine  equations,  see  [49-50]. 

Apparently,  one  of  the  conditions  that  guarantees  the  existence  of  fast  polynomial 
multiplications  over  Z,  is  the  existence  of  algebraic  number  fields  K,  [K  :  Q]  =  n  with  a 
(large)  number  M  of  “exceptional  units”.  We  say  that  61,. . .  ,6m  are  “exceptional  units” 
in  if  if  f  1  =  1  and  e,  -  cyis  a  unit  in  K  for  1  ±  j.  This  concept  first  appears  in  works  of 
Minkowski  and  Hurwitz.  Recently  it  was  studied  by  Lenstra.  In  particular,  Lenstra  had 
shown  that  the  following  explicit  bound 

m>£(±)’\Dk\K 

(where  Dk  is  the  discriminate  of  K  and  a  is  the  number  of  complex  archimedean  primes) 
on  the  number  M  of  exceptional  units  implies  the  Euclidianity  of  K  with  respect  to  the 
norm  of  K. 

Obvious  bound  on  M  for  an  arbitrary  field  K  of  degree  N  is: 

2  <  M  <  2n. 

On  the  other  hand,  by  looking  at  subfields  of  cyclotomic  fields  and  subfields  of  Abelian 
extensions  of  quadratic  imaginary  fields,  one  can  construct  infinite  families  of  fields  K  of 
degree  n  such  that 

Af  =  0(n(log  log  n)2) 

for  the  number  M  of  exceptional  units  in  K. 

Another  similar  object,  also  connected  with  Euclidianity,  is  useful  in  the  construction 
of  fast  polynomial  multiplication  algorithms  over  ring  of  constants  Z[l/2].  In  this  case, 
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one  considers  a  sequence  of  elements  :  t  =  1,  •  •  • ,  M  in  K  such  that  6i  —  6}-  are  (nonzero) 
divisors  of  powers  of  2.  Such  sequences  and  corresponding  polynomials  were  introduced 
by  H.  Cohn  [51]  as  “dyadotropic”. 

Another  interesting  arithmetic  problem  arising  from  the  construction  of  fast  algo¬ 
rithms  is  the  existence  of  long  sequences  of  "relatively  prime  over  Z*  polynomials.  In  this 
problem,  one  is  looking  at  a  sequence  (Qa(x)  :  a  6  A)  of  polynomials  from  Z[x]  (with,  say, 
leading  coefficient  1)  such  that  for  a  ^  0, 

res(Qa,Q0)  =  1 

(i.e  BQa  +  CQ0  =  1  for  B,  C  €  Z[x]).  One  can  understand  these  sequences  better  by 
looking  at  reductions  modp  (particularly  for  p  =  2)  and  comparing  lists  of  irreducible 
polynomials  over  Fp[xj.  Ideal  sequences  should  be  long  with  low  degrees  of  Qa.  As  with 
Euclidian  fields,  the  best  source  of  construction  of  such  sequences  is  by  means  of  division 
polynomials,  particularly  for  elliptic  curves  with  complex  multiplication.  The  correspond¬ 
ing  elliptic  divisibility  polynomials  for  an  elliptic  curve  E/Q  with  complex  multiplication 
in  an  imaginary  quadratic  field  L  (with  the  class  number,  say,  one)  are  given  by: 

U  $(u)NormM 

where  n  €  O  (O  is  an  order  in  L),  and  6(u)  is  a  normalized  a  -  function  (corresponding 
to  the  lattice  A  =  CIO  in  C: 

0(u)  =  <r(u)e“*^A^u,/,2, 

s2(A)  =  lira  ^2  w-2|wj'2*. 

<-*o+ 

w€A.w^0 

For  moderate  sizes  of  A,  we  had  constructed  sequences  of  relatively  prime  polynomials 
over  Z  explicitly. 

Our  new  algorithms  of  polynomial  multiplication  over  Z  for  small  n  (n  <  100)  are 
matching  in  minimal  complexity  algorithms  over  F2.  For  arbitrary  n,  we  can  improve 
known  upper  bound  of  the  minimal  complexity,  p.z{nyn)  of  multiplication  of  polynomials 
of  degree  n  —  1  with  the  ring  Z  of  constants  by  a  factor  log2  log  log  n. 

Development  of  entirely  new  classes  of  minimal  complexity  algorithms  is  closely  con¬ 
nected  with  the  study  of  arithmetic  surfaces  (a  la  Arakelov-Faltings).  The  development 
of  our  interpolation  methods  on  algebraic  manifolds  strongly  suggests  a  conjecture,  that 
all  polynomial  multiplication  algorithms  are  reducible  to  combinations  of  valuation  al¬ 
gorithms.  These  valuation  algorithms  correspond  to  archimedean  and  nonarchimedean 
valuations  on  appropriate  algebraic  surfaces.  Classical  algorithms  (including  high-school 
method,  ordinary  interpolation  algorithms,  residue  number  systems,  FFT  and  number- 
theoretic  FFT)  all  correspond  to  projective  spaces  kPn.  An  open  problem  on  whether 
Hz{n,n)  =  O(n)  is  equivalent  in  this  context  to  special  properties  of  L-functions  of 
Abelian  varieties  at  s  =  1. 

Let  us  look  now  at  the  total  algebraic  complexity  of  polynomial  multiplication  al¬ 
gorithms.  First  of  all,  there  are  yet  no  satisfactory  definitions  of  total  (multiplicative, 
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additive,  scalar,  etc.)  complexity,  corresponding  to  the  true  hardware  organization.  Inter¬ 
esting  definitions  and  results  belong  to  V.  Pan.  In  line  with  his  definitions,  we  can  give 
some  lower  end  bounds  on  the  total  algebraic  complexities  of  polynomial  multiplication. 
For  this  we  consider  an  arbitrary  (commutative)  scheme  having  (i)  n  nonscalar  (essential) 
multiplications;  (ii)  a  chain  additions,  and  (iii)  a  scalar  multiplications.  We  have  to  assign 
a  weight  to  each  scalar  multiplication.  If  c  is  a  scalar  which  is  an  algebraic  number,  its 
size  M[c)  (or  height)  is  usually  determined  as 

M(c)  =  JJmax(l,||c||„), 

V 

where  v  runs  through  all  valuations  of  the  smallest  field  K  containing  c.  E.g.  for  a  rational 
c  =  p/q  7^  0  with  (p,  q)  =  1,  M(c)  =  max(|p|,  \q\). 

It  is  not  unreasonable  to  attach  to  the  scalar  multiplication  by  c  a  weight  log  M(c) 
(this  is  an  understatement  of  a  hardware  performance). 

The  total  "cost”  of  the  computational  scheme  can  be  represented  as 

C  =  n  +  a  +  ^  log  M(c), 

C 

were  p  and  a  are  the  numbers  of  multiplication  and  addition  chains,  and  c  runs  through 
all  scalar  multiplication  chains. 

A  lower  bound  on  C  defined  this  way  in  the  problem  of  multiplication  of  two  polyno¬ 
mials  of  degree  n  —  1  is  non-linear  in  n: 

C  >  n  Iog2  n. 

On  the  other  hand,  in  our  new  low  multiplicative  complexity  algorithm  presented  here, 
the  "total  cost”  C  is  always 

C  =  0(n  log  n), 

which  makes  these  algorithms  as  attractive  in  applications  as  any  of  Fast  Fourier  methods. 

To  decrease  the  additive  complexity  (and  cost  of  scalar  multiplications)  one  has  to  ex¬ 
amine  the  choice  of  evaluations  or  interpolation  points  (choices  of  valuations,  archimedian 
and  nonarchimedian) .  These  choices  are  determined  by  the  hardware  realization,  were 
the  binary  structure  of  numbers  plays  a  crucial  role,  e.g.  multiplications  by  powers  of  2 
amount  to  simple  shifts.  This  problem  appeared  initially  in  polynomial  interpolation  and 
is  described  in  [52],  see  also  [53].  Explicitly  we  can  ask  of  the  cost  of  evaluation  of  n-th 
degree  polynomial  at  n  +  1  points,  (x, }  (over  an  appropriate  ring  of  scalars,  typically  a 
field).  Following  [52],  evaluation  of  a  set  of  points  X*  is  called  "fast”,  if  it  can  be  accom¬ 
plished  in  0(n log  n)  operations.  The  FFT  algorithms  provide  with  sets  of  points,  where 
fast  evaluation  can  be  accomplished.  These  points  are  roots  of  unity  of  a  given  order, 
which  is  highly  composite,  typically  a  power  of  2.  In  any  ring,  were  such  roots  of  unity 
exist  (and  the  order  of  roots  can  be  inverted),  fast  evaluation  is  possible.  It  is  this  fast 
evaluation  by  means  of  FFT  that  gives  rise  to  fast  convolution  and  then  fast  interpolation 
algorithms  based  on  fast  convolutions. 
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We  developed  recently  a  variety  of  new  fast  algorithms  of  rational  function  interpo¬ 
lation  and  evaluation  the  are  not  constrained  by  restrictions  of  (number-theoretic)  FFT 
relating  the  word  size  and  the  length  of  the  transformation. 

To  describe  our  algorithms  we  look  now  at  a  general  rational  function  interpolation 
problem.  In  the  partial  fraction  representation  of  rational  functions  one  looks  at  the 
rational  function  with  poles  only  at  given  (distinct)  points:  a x,...,an.  The  general  form 
of  such  a  rational  function  is 


(6.3) 


=  E 


x< 


z  —  on 


Thus  its  divisor  is  (—  ai  +  °°)*  The  evaluation  problem  for  this  function  (or  its 
divisor),  consists  of  simultaneous  determination  of  n  values  of  P(z)  at  z  —  (distinct 

from  ai): 

n 

(6.4)  yj  =  R{z)\z=0,  :i  = 

One  can  consider  the  transformation  from  T\  in  (6.3)  to  y,  in  (6.4)  as  a  discrete  (rational) 
transform  determined  by  two  divisors  or  A  -  (]C)=00  a>)  an<*  ^  =  (£^=oo  ^|)-  ^  *s  easy 
to  see  that  the  inverse  to  this  transformation  is  the  following  explicit  one: 


(6.5) 


H  = 


y  PB(ai)  PA(0})  yy 

jz f  p'aM  pW>Y  ' 


i  =  Here,  P*(z),  P»(x)  are  polynomials  of  degree  n  having  as  roots  {a^},  and 

{0i},  respectively. 

Into  the  scheme  (6.3-5)  fall  discretizations  of  important  one-  and  multi-dimensional 
integral  transform  (with  a  variety  of  quadrature  approximations  methods).  Among  singu¬ 
lar  integral  transformations  that  can  be  described  by  direct  and  inverse  schemes  (6.4-5) 
the  most  obvious  is  the  Hilbert  transform  on  a  circle,  its  proper  discretization,  and  com¬ 
putations  via  FFTs  are  described  in  detail  in  [55].  A  finite  Hilbert  transform  corresponds 
in  the  scheme  (6.4-5)  to  c*i  being  N-th  roots  of  1,  and  (3j  being  N-th  roots  of  -1. 

We  present  now  several  classes  of  fast  rational  evaluation  algorithms  with  comput?- 
tional  cost  0(n  log  n),  that  generalize  finite  Hilbert  transforms.  These  new  transformations 
correspond  to  a  variety  of  singular  integrals  taken  over  one-dimensional  complex  continuum 
and  to  singular  integrals  over  fractal  sets,  and  to  singular  integrals  with  elliptic  function 
kernels.  The  latter  transformations  have  interesting  number  theoretic  and  modular  inter¬ 
pretation  and  we  refer  to  them  as  Fast  Elliptic  Number  Theoretic  Transforms  (FENTT). 

To  describe  all  transformations  of  this  class  that  can  be  evaluated  in  0(n  log  n)  oper¬ 
ations,  we  look  at  ai  and  0}-  given  as  roots  of  polynomials  P*(z)  and  Pb{z),  respectively, 
where  polynomials  Pa  and  Pg  correspond  to  iteration  of  (fixed)  polynomials  and  rational 
functions.  Thus  we  start  with  a  sequence  of  degrees  (radices)  D i, ...,  Dm  and  with  rational 
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functions  (polynomials)  Ri(z),...tRm{z)  of  degrees  respectively.  The  polyno¬ 

mials  Pa  and  PB  have  degrees  n  =  Di...Dm  and  have  roots  as  preimages  of  two  distinct 
points  a  and  /?  under  iterated  mappings 

2»  +  l  =  Ri(Zi). 

Thus  Pa{z)  is  defined  as  (the  numerator  of)  Rm(Rm-i(...(Ri(z))...))  =  a,  and  Ps(z) 
as  (the  numerator  of)  Rm(RTn-l(...(Rl(z))...))  =  /?. 

The  most  interesting  case  is  that  of  Di  =  ...  =  Dm  =  D  and  Ri  =  ...  =  Rm  =  R  being 
a  fixed  rational  function  of  degree  D.  The  fast  algorithm  to  compute  (6.4-5)  is  similar  in 
the  flow  diagram  to  the  mixed  radix  FFT  schemes. 

The  mixed  radix  FFT  algorithm  is  a  special  case  with  Ri(z)  =  zDi  :  *  =  l,...m.  The 
total  computational  co6t  of  such  mixed  degree  algorithm  depends  on  the  computational 
cost  of  multiplication  of  polynomials  of  degrees  0(D»).  In  case  of  D»  =  O(D),  and  of  large 
m,  the  total  computational  cost  of  evaluation  of  (6.4-5)  is  0(mDi...Dm). 

Among  rational  transforms  that  fit  into  this  scheme  we  can  mention:  a)  Hilbert- 
like  transforms  for  various  Julia  sets  corresponding  to  polynomial  or  rational  mappings 
z  — »  /Z(-s);  b)  singular  integral  transformations  with  elliptic  function  kernels.  In  the  case 
a),  contours  of  integration  can  have  arbitrary  fractional  dimension.  In  the  case  b),  a 
continuous  analog  of  the  transformation  (6.5)  is  the  following 


y(s)=  r  X(t)p(t-s)dt 

Jo 


for  the  Weierstrass  elliptic  function  p(u).  The  latter  transform  and  its  discrete  versions 
are  particularly  well-suited  for  arithmetic  interpretations.  If  an  elliptic  curve  E  is  defined 
over  a  finite  field  k  =  Fp  (e.g.  is  a  reduction  mod  p  of  an  elliptic  curve  over  Q),  then  the 
set  of  its  k-rational  points  is  an  Abelian  group  of  order  Np  =  p  —  ap  +  1  for  |ap|  <  2 v/p. 
Moreover,  for  any  integral  a,  ja|  <  2 y/p,  there  is  an  elliptic  curve  over  Fp  with  Np  =  p— a-f  1. 
Whenever  2n  divides  Np,  one  has  points  of  order  2n  on  an  elliptic  curve,  all  defined 
over  Fp.  Consequently,  the  fast  evaluation  algorithm  can  be  applied  in  this  case  with 
D i  =  ...  =  Dm  =  4.  The  rational  function  R{z)  in  this  case  is  the  duplication  formula  for 
x-coordinate  in  the  Weierstrass  cubic  form  of  an  elliptic  curve  E:  y 2  =  4x 3  —  —  gy. 


R{x)  =  -2x  + 


(6x2  -  g2/2)2 
4y3 


The  choice  of  a  and  (3  should  be  of  x-coordinates  of  second  order  points  on  E.  When¬ 
ever  prime  p  is  such  that  p  is  within  distance  2 y/p  from  a  power  of  2  (or  from  a  highly 
composite  number)  one  has  a  very  fast  algorithm  of  rational  evaluation  and  transformation 
of  length  0(p)modp. 

The  same  method  can  be  used  for  a  composite  number  of  M  if  one  chooses  an  appro¬ 
priate  elliptic  curve  over  Z/MZ,  whose  reduction  mod  p  for  prime  factors  p  of  M  have 
highly  composite  Abelian  group  of  Fp-rational  points.  Using  the  standard  facts  of  the  dis¬ 
tribution  of  highly  composite  numbers,  see  [56],  we  conclude  that  with  any  number  M  we 
have  FENTT  of  length  O(M)  over  Z/MZ  with  computational  cost  0(M  log  M).  FENTT 
algorithms  are  particularly  attractive  in  parallel  implementation. 
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7.  Evaluation  of  Solutions  of 
Linear  Differential  Equations. 


Recently  we  have  developed  new  fast  algorithms  of  power  series  computations  for  im¬ 
portant  classes  of  algebraic  functions  and  solutions  of  linear  differential  equations  with 
rational  function  or  algebraic  coefficients,  [32),  [36],  [27].  These  algorithms  provide  also 
with  efficient  methods  of  evaluation  and  of  analytic  continuation  of  solutions  of  these  equa¬ 
tions.  Our  algorithms  are  all  based  on  the  analysis  of  (linear)  recurrences  on  coefficients 
of  power  series  expansions. 

We  differentiate  between  the  operational  and  the  total  (boolean)  complexity.  By 
operational  complexity  of  an  algorithm  one  understands  the  number  of  primitive  operations 
(most  notably  additions  and  multiplications),  independent  of  the  sizes  of  numbers  involved, 
needed  to  complete  this  algorithm.  By  the  total  (or  boolean)  complexity  we  understand 
the  total  number  of  primitive  operations  (on  short  or  single-bit  data)  needed  to  complete 
a  given  program.  The  main  distinction  between  the  conversion  from  the  operational  to 
the  total  complexity,  depending  on  the  size  of  numbers  involved,  is  described  by  the  total 
complexity  of  multiplication  of  big  numbers. 

Let  us  denote  by  M(n)  the  total  complexity  of  multiplication  of  two  n-bit  integers. 
Then  the  best  known  upper  bound  on  M(n)  belongs  to  Schonhage-Strassen  [57]: 

M(n)  =  0(n  log  n  log  log  n). 

In  comparison,  a  total  complexity  of  addition  is  relatively  simple:  it  is  only  O(n). 

All  algebraic  operations  on  bigfloats  have  boolean  complexity  of  the  same  order  of 
magnitude  as  a  multiplication.  For  example,  if  B(n)  denotes  one  of  the  following  total 
complexities:  division  of  n-bit  bigfloat  numbers,  square  root  extraction,  or  raising  to  the 
fixed  (rational)  power,  then  B[n)  =  0(M(n)),  and  M(n)  =  0(B(n)). 

Our  methods  based  on  fast  computation  of  solutions  of  matrix  linear  recurrences 
allow  us  to  construct  new  algorithms  of  low  total  complexity  for  evaluation  of  solutions 
of  linear  differential  equations  with  an  arbitrary  precision.  Such  algorithms  of  low  total 
complexity  were  till  now  unknown  for  general  special  functions.  Low  complexity  algorithms 
were  known  only  for  algebraic  function  computations  (standard  Newton  method),  and  for 
elementary  functions. 

We  describe  now  our  new  algorithms  [27,  36]  of  low  total  complexity  for  computation 
of  values  of  solutions  of  arbitrary  differential  equations,  neither  related  to  any  rapidly  con¬ 
vergent  analytic  transformations,  nor  to  any  low  operational  complexity  methods.  These 
methods  differ  in  boolean  complexity  at  worst  by  factor  log2  n  or  log3  n  from  the  boolean 
complexity  Af(n)  of  algebraic  computations. 

Let  us  look  at  an  arbitrary  linear  differential  equation  with  rational  (polynomial) 
coefficients,  either  in  the  scalar  form 

any(n)  +  an-iy(n_1)  +  *  •  *  +  aiy'  +  aoy  =  0,  (7.1) 
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or,  in  the  general  matrix  form, 


(7.2) 

where  a*  6  C(x),  and  A{x )  €  Mn(C(x)).  We  are  interested  in  the  evaluation  of  solutions 
of  (7.1)  and  (7.2)  with  arbitrary  precision  using  the  method  of  (formal)  power  series  ex¬ 
pansions  of  [36].  Having  specified  initial  conditions  for  y(z)  or  Y(x )  at  x  =  x0,  we  want  to 
evaluate  within  a  given  precision  l  y(x)  or  Y(x)  at  another  point  x  =  xi.  For  all  practical 
purposes  we  assume  that  values  of  xo  and  xi  are  given  correctly  with  the  precision  of  l 
bits,  or  as  rational  or  algebraic  numbers  of  sizes  less  than  l.  To  determine  values  at  x  =  Xi 
from  those  at  x  =  xo  one  has  to  specify  a  path  7  from  xo  to  xi  on  the  Riemann  surface 
(or  its  universal  covering)  of  y(x)  or  Y{x),  see  [27], 

The  global  recipes  [27,  36]  for  the  analytic  continuation  of  Y  (x)  along  7  are  combined 
with  better  local  methods  of  evaluation  of  Y(x)  from  x  =  Xo  to  x  =  ii.  For  this  we  are 
using  local  power  series  of  Y(x)  to  continue  Y(x)  from  x  =  xq  to  x  =  Xi  making  several 
steps  between  x0  and  Xi,  releasing  consecutive  blocks  of  X\  in  bursts.  We  call  this  method 
“bit-burst”  method.  This  way  we  arrive  at  the  following  general  theorem  that  gives  an 
upper  bound  on  the  total  (boolean)  complexity: 

Theorem  7.1.  Let  (7.2)  be  a  given  linear  differential  equation  with  rational  function 
coefficients,  and  Y{x)  be  its  arbitrary  (regular)  solution  with  initial  conditions  at  x  =  x,n, 
where  Xin  is  an  Ff-bit  number.  Given  a  path  7  from  x,„  to  an  Ff-bit  number  ifin  (on  the 
Riemann  surface  of  F(x))  of  length  L,  one  can  evaluate  Y(xyx-Xfi%  at  x  =  x/in  with  the 
full  Ff-bit  precision  at  most 


0(M(K)(log3K  +  logL)) 


bit  operations. 

The  bit-burst  method  in  the  general  form  of  Theorem  7.1  is  not  the  best  possible: 
one  would  like  to  see  log3  K  replaced  by  log  K  always.  Sometimes  the  complexity  can  be 
lowered: 

I.  If  Xin  and  Xfin  are  fixed  rational  numbers,  then  the  computation  of  Y[x)  with  the 
full  K  bits  of  precision  has  total  (bit)  complexity  at  most  0(M(K)(\og2  K  +  logL)). 

II.  If  the  differential  equation  (7.1)  or  (7.2)  possesses  special  arithmetic  properties, 
bit-complexity  can  be  lowered.  E  g.  if  the  (7.1)-(7.2)  possesses  a  solution  which  is  either 
an  .E-function  or  a  G-function,  then  the  general  bit  bound  of  Theorem  7.1  can  be  lowered 
to 

0(.\f(K)  •  (log2  K  -I-  log  L)). 

If,  further,  like  in  xtn  and  x/in  are  fixed  rational  numbers,  and  Y{ x)  is  built  from  E-  or 
G-functions,  then  K  significant  digits  of  K(xo)  can  be  computed  in 

0(.\f(K)  ■  (log  K  +  log  L)) 


bit  operations. 
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This  last  bound  is  unsurpassed  by  any  other  algorithms  even  for  elementary  functions, 
like  the  exponent,  where  low  operational  complexity  algorithms  are  well  known,  see  [58]. 

Our  original  interest  in  the  development  of  power  series  evaluation  facilities  was  purely 
transcendental.  We  wanted  to  have  the  ability  to  compute  the  monodromy  group  of  linear 
differential  equations  with  an  arbitrary  precision  to  check  various  hypotheses  of  tran¬ 
scendence,  algebraicity  and  the  existence  of  algebraic  relations  among  elements  of  the 
monodromy  matrices.  The  “constants”  appearing  as  elements  of  monodromy  matrices  en¬ 
compass  classical  constants  of  geometry  and  analysis.  Among  them  there  are  periods  of 
algebraic  varieties  including  values  of  the  Euler  T-  and  B-functions  (at  rational  points)  and 
other  integrals  of  elementary  and  algebraic  functions  over  closed  paths. 

To  compute  the  monodromy  group  of  linear  differential  equations  we  use  the  power  se¬ 
ries  method  based  on  the  direct  analytic  continuation  of  a  fundamental  system  of  solutions. 
In  this  method  one  starts  with  a  fundamental  system  of  solutions 

y  =  (yi(*),---,yn(z)) 

of  (7.1)  [given  by  their  initial  conditions]  and  analytically  continues  it  along  a  closed  path 
with  no  singularities  on  it.  The  system  Y  analytically  continued  along  a  path  7  undergoes 
a  linear  transformation 

y~7y-M(7). 

The  set  of  all  matrices  M(~t)  is  a  monodromy  group  of  (7.1). 

Areas  of  applications  include: 

1)  multiple  precision  computation  of  Abelian  integrals  and  their  periods.  These  com¬ 
putations  are  used  then  in  the  transcendental  solution  of  the  problem  of  reduction  of 
Abelian  integrals.  In  order  to  determine  the  reducibility  of  Abelian  integrals  to  the  lower 
genra  (e.g.  when  an  Abelian  integral  is  an  elementary  function)  one  looks  at  the  Z-relations 
between  periods. 

2)  Another  application  of  monodromy  computations  is  to  the  solution  of  the  direct 
and  inverse  Galois  problem,  when  one  wants  to  find  a  Galois  group  of  a  given  algebraic 
function  field  (differential  equation)  or  wants  to  construct  a  field  with  a  given  Galois 
group.  Our  package  is  designed  mainly  for  the  direct  Galois  problem,  but  we  found  it 
extremely  convenient  to  use  for  solution  of  the  inverse  Galois  problem  when  the  number  of 
parameters  is  not  large  and  one  can  numerically  invert  the  function  (monodromy  matrix 
output)  generated  by  our  program. 

3)  Uniformization  theory  seems  to  be  an  attractive  proving  ground  for  application  of 
monodromy  packages.  For  us  the  crucial  problem  was  the  question  of  arithmetic  nature 
of  parameters  in  the  solution  to  the  uniformization  problem,  including:  a)  algebraic  equa¬ 
tions  (i.e.  their  coefficients)  defining  Riemann  surfaces  to  be  uniformized;  b)  invariants  of 
discrete  groups  that  uniformize  these  Riemann  surfaces  (Fricke  parameters  and  more  so¬ 
phisticated  parametrizations  of  Teichmuller  spaces);  and  finally,  the  most  notorious  group 
of  parameters  c)  assessory  parameter  that  uniquely  determine  the  differential  equation  of 
the  second  order,  ratio  of  solutions  of  which  determines  the  inverse  to  the  uniformizing 
function. 
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Interesting  applications  of  our  numerical  studies  of  uniformization  theory  include  mas¬ 
sive  computations  of  accessory  parameters  (and  corresponding  Fuchsian  groups)  for  hyper- 
elliptic  surfaces.  One  of  the  results  of  the  computations  [32]  was  the  negative  solution  to  the 
Whittaker  conjecture  (1928),  which  predicted  the  explicit  expression  of  accessory  parame¬ 
ters  for  hyperelliptic  surfaces  of  genus  g  >  2,  by  looking  at  their  birational  transformations. 
The  true  structure  of  Teichmuller  spaces,  as  revealed  by  our  numerical  computations  is 
quite  complex  [27],  [32]. 

4)  Another  group  of  applications  of  explicit  solutions  of  accessory  parameter  prob¬ 
lem  involves  explicit  determination  of  conformal  mappings  of  complicated  domains  using 
appropriate  linear  differential  equations.  This  gives  a  high-precision  method  that  can  be 
used  in  adaptive  grid  computations. 

Complexities  of  power  series  computations  can  be  also  investigated  from  the  point 
of  view  of  parallel  (vector)  implementation.  The  parallel  (vector)  methods  are  important 
because  they  seem  to  be  the  only  way  to  address  large  jobs.  Algorithms  that  compute  power 
series  coefficients  and  values  of  functions  by  means  of  linear  recurrences  are  particularly 
well-suited  for  various  vector  and  parallel  implementations. 

The  methods  that  we  propose,  bit-burst  algorithms  for  computations  of  arbitrary 
linear  differential  equations,  have  always  depth  O(lognloglogn)  even  though,  in  general, 
the  total  bit  operation  count  is  0(M(n)  •  log3  n).  In  fact,  for  E-  and  G-functions  the  total 
bit  operation  count  with  the  same  depth  is  only  0(M(n)  •  log2n). 


8.  Polynomial  Root  Finding 


One  calls  a  (univariant)  polynomial  sparse,  if  it  has  a  number  of  nonzero  coefficients 
significantly  less  than  its  degree. 

Sparse  polynomials,  particularly  trinomials  and  quadrinomials,  are  extremely  inter¬ 
esting  from  many  points  of  view.  Let  us  mention  now  just  one  problem:  which  Galois 
groups  G  can  be  realized  as  Galois  groups  of  trinomials  axn  +  bxm  +  c  =  0  (for  a,  b,  c  6  Z 
or  for  a,  6,  c  6  Z[x])? 

The  generic  Galois  group  of  a  trinomial  of  degree  n  is  Sn,  so  roots  of  a  trinomial 
cannot  be  determined  in  closed  form  for  n  >  5.  Nevertheless  one  can  express  the  roots  of 
trinomials  in  a  closed  form  as  infinite  series  with  binomial  coefficients.  Such  an  expression 
can  be  found  in  Ramanujan’s  writing. 

Ramanujan  normalizes  the  trinomial  equation  in  the  following  form 

aqxp  +  xq  =  1,  a  >  0  and  0  <  q  <  p. 

For  n  >  0  and  a  particular  root  of  a  trinomial  aqxp  +  xq  =  1,  Ramanujan  [Rl,Ch.3  of 
his  Second  Notebook]  proves  the  expansion: 
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(8.1) 


This  expansion  converges  when  M  <  p~p,q  ■  (p-q)(p~q'"}. 

This  expansion  of  Ramanujan  is  usually  attributed  to  Lambert  (1758).  The  usual 
derivation  of  this  formula  is  based  on  Lagrange  inversion  theorem  (1770).  References  to 
this  and  other  similar  Ramanujan’s  formulas  can  be  found  in  Brendt,  Evans,  Wilson,  [59]. 

“Ramanujan’s”  formula  can  be  also  derived  directly  from  the  trinomial  equation  using 
the  transformation  of  contour  integrals  into  Barn’s  type  integrals  of  T-functions.  This 
derivation  had  been  achieved  by  Hj.  Mellin  [60],  and  generalized  by  him  to  arbitrary 
algebraic  equations. 

These  integral  representations  are  particularly  convenient  for  sparse  polynomials, 
where  they  also  can  be  converted  into  multivariate  power  series  expansions. 

Let  us  look  at  an  arbitrary  k  +  2-nomial 


xn  +  aixnx  +  a2Xnt  +  ...  +  dkin~k  -1  =  0, 


(8.2) 


n  >  rii  >  njt  >  . . .  >  nfc  >  0.  Then  we  have  the  following  expansion  of  all  n  roots  xy 
of  this  equation,  or  of  their  n  -  th.  powers  : 
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The  series  (8.3)  converge  always  whenever 


max(|aj|  :  i  =  1,. . . ,  k)  <  min(n  /  {kyj[n\nl l(n  -  ni)r‘_ni),  n/(k\/[n]n^k  (n  -  rik)n  n*)). 

In  principle,  the  power  series  formulas  like  Ramanujan  (8.1)  or  Mellin’s  (8.3)  can  be 
used  directly  for  the  root  finding  of  sparse  polynomials.  It  is  much  more  practical,  though 
to  use  our  methods  of  analytic  continuation  of  power  series  solutions  of  linear  differential 
equations  satisfied  by  algebraic  functions.  These  differential  equations  can  be  represented 
in  terms  of  equations  on  multidimensional  generalized  hypergeometric  functions.  These 
equations  can  be  integrated  and  analytically  continued  everywhere.  The  most  general 
expression  for  a  system  of  Fuchsian  linear  differential  equations  on  xj*,  as  functions  of 
ai>  •  •  •  i  afc,  can  be  deduced  from  the  power  series  representation  of  (8.3)  for  Xy . 

Namely,  the  (algebraic)  functions  Xy  for  roots  xy  of  (8.2)  satisfy  the  following  system 
of  k  Fuchsian  linear  differential  equations  on  d/dai,..  .,djdak- 

{(-i)"’  •  n"  •  -  n m =0  (»i«i  +  •  •  •  +  +H  +  nm) 


*  nm=0  (»#X«1^  + 


+  n'kak-z - H  +  nm)}x(f  =  0;  n[  =  n  —  n*  :  »  =  1, . . . , k.  (8.4) 

dak  3 


This  system  (8.4)  of  linear  differential  equations  can  be  integrated  starting  from  ax  = 
a2  =  . . .  =  ak  —  0  with  initial  conditions  xy  =  Cy  for  n  —  th  roots  of  unity  e™  =  1  :  j  — 
1, . . .  ,n.  Alternatively  integration  can  start  at  oo,  or  at  any  other  point  in  k-  space,  where 
xy  are  known. 

The  methods  of  analytic  continuation  of  algebraic  functions  xy  from  a<  =  0  can  be 
easily  implemented,  and  its  only  nontrivial  part  concern  the  regular  (Puiseux)  power  series 
expansions  of  xy  in  the  neighborhood  of  singularities  of  differential  resolvent  equations 
(8.4),  see  r6l].  This  leads  to  an  iterative  algorithm,  where  to  compute  the  roots  of  k  +  2 
-nomial  one  has  to  precompute  the  roots  of  k  +  1-nomials  (that  give  the  singularities 
of  the  former  branches  of  algebraic  functions),  etc.  This  method  is  perfect  for  parallel 
implementation,  since  on  each  level  of  iteration  all  roots  are  computed  independently. 
The  storage  requirements  are  determined  only  by  the  number  of  roots  one  wants  to  see. 
Moreover,  this  iterative  method  was  implemented  in  vector  and  array  hardware,  e.g.  on 
IBM  3090- VF  or  on  CRAYs,  because  most  of  the  operations  are  vectorizable  loops. 

The  complexity  bound  for  the  computation  of  roots  of  sparse  polynomials  using  the 
analytic  continuation  of  solutions  of  differential  resolvents  is  the  following. 

Theorem  8.1.  One  can  compute  all  n  roots  of  a  fc-nomial  of  degree  n  with  the 
precision  M  (of  leading  digits)  in  0(k2log2M )  parallel  steps  on  n  processors.  If  k  C  n, 
then  one  can  compute  m(<  n)  roots  of  a  fc-nomial  of  degree  n  with  the  precision  of  M 
(digits)  in  0(k  log2M)  parallel  steps  on  O(m)  processors. 

The  crucial  osbtacle  in  the  complex  root  finding  for  large  degrees  of  polynomials  is 
the  need  for  multiple  precision  computations.  Unfortunately  high  precision  requirements 
make  the  programming  of  the  root  finding  methods  awkward  in  any  vector  or  parallel 
environment.  Because  of  these  constraints  our  fast  polynomial  root  finding  algorithms 
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for  dense  polynomials  involved  degrees  only  in  thousands.  The  largest  “random”  dense 
polynomial  we  completely  solved  in  double  precision  on  a  single  processor  of  CRAY  II  had 
degree  15,000. 

The  target  of  our  large  degree  polynomial  root  finding  programs  was  the  analysis  of  the 
distribution  of  complex  roots  of  a  real  (or  complex)  polynomial  with  random  coefficients. 
Specifically  we  looked  at  normally  distributed  random  coefficients,  though  various  other 
distributions  were  analyzed  as  well.  The  distribution  of  real  roots  of  a  random  polynomial 
of  degree  n  (there  are  0(logn)  of  them)  was  described  by  Hardy-Littlewood-Kac.  Much  less 
is  known  about  the  distribution  of  the  complex  roots  in  addition  to  the  obvious  statement 
that  “most  of  the  roots  are  uniformly  distributed  around  the  unit  circle.” 

We  have  conducted  extensive  computations  of  complex  roots  for  large  series  of  random 
polynomials  (normal,  uniform,  uniform  {0,1},  and  other  distribution  of  coefficients)  of 
degrees  varying  from  500  to  15,000.  While  degrees  up  to  1,000  are  easy  to  handle  on  a 
PCAT  with  an  accelerator  board  (in  our  case  it  was  DSI-020),  degrees  higher  than  1,000 
required  a  vector  facility.  We  used  the  vector  facility  of  IBM  3090-VF  for  degrees  up  to 
7,000,  and  CRAY  II,  with  time  provided  by  the  NSF  Grant,  for  degrees  7,000-10,000  with 
the  maximum  of  15,000.  The  limit  on  degrees  is  a  consequence  of  precision  constraints, 
and  an  increase  in  precision  significantly  inhibits  the  vectorization  and  slows  down  the 
computations.  We  present  some  of  the  outputs  of  the  pictures  of  complex  roots  of  random 
polynomials  with  normal  distribution  of  coefficients,  with  parts  of  the  picture  blown  up 
for  detail. 

These  computations  were  based  on  our  new  parallel  algorithms  of  root  finding  for 
dense  polynomials. 

The  crucial  problem  in  the  construction  of  the  truly  polynomial  time  root  finding 
methods  is  the  ability  to  deal  with  possible  clustering  of  roots. 

We  propose  a  new  (probabilistic)  method  based  on  the  analytic  continuation  of  al¬ 
gebraic  functions  through  their  singular  points.  This  method  uses  in  an  indirect  way  the 
homotopy  method,  the  Euler-Newton  method  and  the  Lobachevsky  method.  Our  method 
is  based  on  the  deformation  of  a  polynomial  P(x )  €  Z[x)  degP  =  n,  into  a  bundle  P(x,t ) 
with  the  initial  position  P(x,0)  =  Pq{x)  for  a  random  polynomial  P0(x),  and  with  the 
final  position  P(x,  1)  =  P(x).  We  then  study  the  algebraic  functions  ij  =  Xi(t)  that  are 
branches  of  an  algebraic  function  defined  by  the  equation 

P  (x,  0  =  0. 

Theorem  8.2.  If  P(x)  6  Z|x!  has  degree  n,  then  all  n  zeroes  of  P(x)  can  be  found 
with  the  precision  M  of  leading  digits  ( M  »  1)  in  0[M logn)°^  parallel  steps  on  n0^1) 
processors.  For  a  generic  polynomial  P(z)  of  degree  n,  all  n  complex  zeroes  of  P(x)  can 
be  found  with  the  precision  of  M  leading  digits  (Af  >  1)  in  0{log  M  log  n)°^  parallel 
steps  on  n0!1)  processors. 
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Plot  Shows  Complex  Zeros  of  the  Polynomial 
The  Degree  of  the  Polynomial  Is  7000 

This  is  a  random  polynomial.  Seed  is  -1 

This  is  a  Whole  Root  Picture 


v  axis 


Plot  Shows  Complex  Zeros  of  the  Polynomial 


The  Degree  of  the  Polynomial  Is  7999 

This  is  a  random  polynomial.  Seed  is  -3 


This  is  a  Window  of  the  Whole  Root  Picture 


y  axis 
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Abstract.  Graph-theoretical  and  topological 
properties  of  large  interconnection  networks  are 
studied  for  the  purpose  of  construction  of  mas¬ 
sively  parallel  computers.  The  main  goals  are 
the  determinantion  of  networks  of  a  given  size 
(number  of  nodes  and  links)  having  the  smallest 
transmittion  delay,  or  having  the  fastest  routing 
of  data  between  processors  according  to  a  given 
permutation.  This  problem  is  analyzed  for 
regular  graphs  (having  large  automorphism 
groups),  and  especially  for  Cayley  graphs  of  clas¬ 
sical  finite  groups.  Tables  of  new,  record  (AJD)- 
graphs  (of  maximal  size  for  a  given  degree  A 
and  diameter  D)  are  presented  for  A  =  4.  These 
graphs  arise  from  groups  GL?(Fp).  The  relative 
merit  of  these  models  or  parallel  Computers  is  ex¬ 
amined.  The  deterministic  and  randomized  algo¬ 
rithms  of  data  routing  according  to  a  given  per¬ 
mutation  are  studied  for  regular  graphs  with 
results  close  to  the  best  possible.  The  practical 
realization  of  (AJ))-graphs  with  A  *  4  is  sug¬ 
gested  in  a  form  of  a  network  of 
TRANSPUTERS™. 

§0.  Background. 

Historically,  one  of  the  first  researchers  who 
singled  out  the  topological  (graph-theoretic)  aspect 
of  the  construction  of  optimal  large  interconnection 
networks  was  Elspas  [1].  Since  then  the  problem 
of  the  topological  and  graph-  theoretic  properties  of 
interconnection  networks  has  attracted  considerable 
attention  for  its  theoretical  merits  and  in  applica¬ 
tions  to  telecommunications  networks  and.  recently, 
in  construction  of  massively  parallel  processors 
and  VLSI.  We  are  interested  in  one  aspect  of  these 
studies:  how  to  design  a  network  so  that  transmis¬ 
sion  delays  are  as  small  as  possible,  uhile  each 
node  (a  station  or  a  processor)  is  connected  to  only 
a  few  other  nodes.  In  modern  graph-iheoreuc  Lan¬ 
guage  this  problem  is  known  as  the  (A.Di  problem, 
where  D  is  the  diameter  of  the  graph  (the  maxi¬ 
mal  number  of  links  used  to  transmit  any  single 
message),  and  A  is  the  degree  of  the  graph  (the 
maximal  number  of  links  incident  at  any  node).  In 
this  problem  the  (AJD) -graph  is  the  one  with  the 


maximal  size  of  a  degree  A  with  a  diameter  of  at 
most  D.  As  it  has  turned  out,  it  is  rather  difficult 
to  construct  explicitly  (AJD)-  graphs.  However,  not 
an  unexpected  result  of  the  study  [2]  shows  that 
"randonT  (i.e.  "almost  all")  regular  graphs  of  de¬ 
gree  A  £  3  have  the  diameter  D  asymptotically 
close  to  the  best  possible  Moore’s  bound  as  D  -> 
<«>.  The  phenomenon  that  random  graphs  seem  to 
have  quite  small  diameter  was  observed  experimen¬ 
tally  by  one  of  the  authors  of  this  paper  (D)  in  the 
course  of  construction  of  a  simulation  machine,  and 
became  the  starting  point  of  this  investigatio.  In 
fact,  random  graphs  seem  to  have  "good"  extremal 
properties  in  many  other  interconner'on  problems. 
These  include  graphs  with  a  large  6irth,  graphs 
giving  rise  to  concentrators  and  superconcentraiors, 
diffusers  and  expanders.  For  a  discussion  of  ran¬ 
dom  graphs,  their  definitions  and  properties,  see 
Bollobas  book  [3].  On  the  other  hand,  random 
regular  graphs  of  large  size  are  not  easy  to  con¬ 
struct  (see  attempts  in  [4]),  their  layout  and  routing 
tables  are  irregular  (this  is  both  their  advantage  and 
disadvantage,  depending  on  the  circumstances), 
and,  most  important,  random  graphs  are  not  the  op¬ 
timal  (AjD)-graphs.  (In  fact,  the  diameter  of  a  ran¬ 
dom  graph  of  a  given  degree  is  always  larger  than 
that  of  the  best  (AjD)-graph  of  a  given  size.) 

The  search  for  the  optimal  (AJD)-graphs  has 
been  conducted  for  some  time  in  an  organized 
fashion  with  a  table  regularly  updated  and 
published.  For  one  of  the  first  tables  see  [5].  In 
the  80’s  better  tables  followed,  [6],  [7],  Now  there 
is  a  regularly  updated  table  of  (A,D)-graphs 
published  by  Bermond  et  al.  [8],  and,  most  recently 
[9].  Most  of  the  best  (A,D)-graphs  from  these 
tables  are  constructed  from  a  few  families  of  spe¬ 
cial  regular  layout  graphs  by  means  of  various  com¬ 
position  operators.  Among  the  building  blocks  of 
these  constructions  the  most  famous  and  important 
is  the  de  Bruijn  family  of  graphs  that  were  redis¬ 
covered  many  times.  We  refer  to  [10]  for  historic 
descriptions  of  XlX-th  and  XX-th  century  dis¬ 
coveries.  De.  Bruijn  graphs  can  be  described  as 
graphs  of  r  vertices  represented  as  words  of 
length  D  £  3  in  r  2  2  letters.  Two  words  are  con¬ 
nected  if  the  last  D  -  1  letters  of  one  are  the  same 
as  the  first  D  •  1  letters  of  another.  The  diameter 


of  this  graph  is  D  and  its  (maximal)  degree  is  2 r. 
These  graphs  were  generalized  in  a  number  of  ways 
(see,  particularly,  [11],  where  the  si/e  of  a  graph 
can  be  any  number).  In  modem  computer  science, 
similar  families  of  graphs  were  introduced  in  [12] 
as  cube-connected  networks  (also  known  as  CCC: 
cube-connected  cycles)  in  connection  with  the  opit- 
mal  area-  time  complexity  VLSI  designs.  These 
networks  of  [12]  have  nodes  represented  by  pairs 
(ij),  where  i  is  mod  N  and  J  is  N-bit  word;  two 
vertices  (i  J)  and  (kj.)  are  connected  if  and  only  if 
either  J  =  L  and  i  -  k  =  1  (mod  N),  or  i  =  k  and  J 
and  L  differ  only  in  i-th  bit.  These  graphs  have 
degree  3,  and  their  routing  diagram  turns  out  to  be 
very  useful  for  the  layout  of  the  FFT  transforma¬ 
tions  in  VLSI  circuits  [12].  For  an  arbitrary  degree 
these  networks  were  generalized  in  [13]. 

We  have  conducted  our  own  search  for  (AJD)- 
graphs  with  particular  attention  paid  to  the  degree  A 
=  4.  The  four  links  at  every  node  corresponding  to 
this  case  have  a  hardware  realization  in  a  very 
popular  series  of  TRANSPUTER™  chips.  These 
chips  can  be  assembled  according  to  any  layout  of 
graphs  of  degree  4,  and  a  crucial  problem  of  the 
best  network  of  transputers  can  be  attacked  as  a 
(AJ))-problem  for  A  =  4.  In  our  experimental 
studies  it  has  turned  out  that  some  of  the  best  (A  J))- 
graphs  have  large  automorphism  groups  ("the  local 
view  from  every  mode  is  the  same"),  and  locally 
many  of  these  graphs  resemble  a  3-tree.  This  im¬ 
mediately  leads  to  two  observations:  a)  that  one 
has  to  consider  Cayley  graphs  or  graphs  repre¬ 
senting  the  action  of  a  finite  group  with  two  gener¬ 
ators;  b)  that  factors  of  the  free  group,  e.g.  factors 
of  the  modular  group  with  respect  to  arithmetic 
(congruence)  subgroups,  should  be  considered.  Fac¬ 
tors  of  modular  group,  particularly  SL2(Fp)  groups 
have  already  been  analyzed  in  connecuon*wiih  dif¬ 
ferent  extremality  problems  (Margulis,  study  of  su¬ 
perconcentrators  and  also  [14]).  Interestingly 
enough,  Caley  graphs  of  SL2CZ/NZ)  and  graphs  of 
action  of  SL2(Z/NZ)  on  P^Z/NZ),  as  well  as 
Cayley  graphs  corresponding  to  factors  of  quater¬ 
nion  Fuchsian  groups  modulo  congruence  sub¬ 
groups,  do  not  give  the  best  diameter  for  a  given 
size  and  degree  (=  4).  These  graphs,  however, 
often  provide  graphs  with  the  best  diffusion  and  ex¬ 
pansion  coefficients  [15],  [16].  (See  below  a  con¬ 
jecture  on  the  relation  between  the  diameter  and  the 
expansion  coefficient)  Rather,  Cayley  graphs  of 
factors  of  the  modular  groups  have  diameters 
similar  to  those  of  random  graphs  in  the  mid-range 
of  the  size  n  of  the  graph  (n  S  106),  but  have  a  dis¬ 
tinct  advantage  of  a  very  regular  layout  and  a 
simple  algorithm  for  generation  of  a  routing  table. 
The  best  (with  respect  to  diameter)  Cayley  graphs 
that  we  found  correspond  to  Borel  subgroups  of 
GL2(Fjj).  These  graphs  are  better  than  those 
known  oefore;  their  routing  properties  are  similar  to 
those  of  CCC.  In  particular,  various  important  per¬ 
mutations  can  be  realized  in  relatively  few  cycles 
on  these  graphs.  We  present  below  some  of  these 
graphs.  We  investigated  the  relative  merit  of  these 


and  other  graph  layouts  as  models  for 
TRANSPUTER™  based  and  other  parallel 
machines.  For  example,  a  crucial  problem  is  the 
routing  of  data  between  processors  according  to  a 
given  permutation.  Our  experiments  showed  that 
the  best  (A,D)  graphs  (looking  locally  like  a  A  -  1  - 
tree)  are  not  the  best  suited  for  this  problem.  From 
this  point  of  view  the  criterion  of  the  minimality  of 
D  with  a  given  A  is  not  necessarily  a  correct  one. 
On  the  other  hand,  the  regular  layout  of  Cayley  and 
other  similar  graphs  with  a  large  automorphism 
group  simplifies  and  accelerates  the  routing  of  data 
according  to  a  fixed  precomputed  (precompiled) 
permutation  (cf.  with  Benes  networks  or  CCC).  Un¬ 
fortunately,  fault-tolerant  studies  of  these  networks 
are  scarce  (in  the  simplest  case  of  a  few  faults  in  de 
Bruijn  networks  see  [17]).  Again  we  relied  on  com¬ 
puter  simulation.  It  seems  that  random  graphs  and 
pseudorandom  graphs,  particularly  Cayley  graphs 
arising  from  modular  groups,  are  quite  tolerant  of 
multiple  faults  of  links  and  nodes  sustaining  fast  in¬ 
tercommunication  inside  the  connected  com¬ 
ponents.  One  can  construct  large  families  of  pseu¬ 
dorandom  graphs  corresponding  to  finite  groups 
with  two  generators.  These  graphs  combine  the 
fault-tolerance  of  random  graphs  with  the  simplicity 
of  the  routing  diagram  in  a  fault-free  setting.  Practi¬ 
cal  realizations  of  these  graphs  in  a  parallel 
machine  will  be  discussed  elsewhere.  The  problem 
of  (AT))-  graphs  is  practically  quite  important  not 
only  for  the  speed  of  intercommunication  but  also 
in  view  of  the  enormous  amount  of  wiring  (cables) 
needed  if  a  wrong  graph  is  chosen.  Since  the 
amount  of  links  needed  to  assemble  N  nodes  into 
a  graph  of  degree  A  is  proportional  to  N  A  /2, 
one  realizes  that  a  popular  n-cube  configuration  (N 
=  2",  A  =  n,  D  =  n)  is  far  from  optimal  even  in  the 
class  of  graphs  with  a  similar  routing  (like  CCC), 
cf.  the  case  of  diameter  D  =  12  with  graphs  from 
Table  1  below. 

§1.  (A,D)-problem  and  Cayley 
graphs. 

For  efficient  parallel  processing  one  needs 
designs  of  microprocessor  networks  that  provide 
with  an  easy  data  routing  and  fast  intercommunica¬ 
tion. 

This  is  reduced  to  the  problem  of  constructing 
large  graphs  with  a  given  degree  and  diameter. 

Definition  1.  Let  G  -  (X£)  be  an  undirected 
graph  with  vertex  set  X  and  edge  set  E.  The  dis¬ 
tance  d(x,y)  between  two  vertices  x  and  y  is  the 
length  of  the  shortest  path  between  x  and  y  (the 
number  of  edges  in  this  path).  The  diameter  D  of 
C  is  D  *  max(x>y\  c  X  x  X  <K*.y)- 

The  degree  of  a  vertex  ts  the  number  of  ver¬ 
tices  adjacent  to  it  (distance  =  I),  and  the  degree  A 
of  G  is  the  maximum  degree  of  vertices  in  X. 


The  extremal  problem  of  degrees  and 
diameters  consists  of  finding  a  graph  with  the  maxi¬ 
mal  number  of  vertices,  denoted  n(A,D),  having 
given  degree  A  and  diameter  D. 

In  interconnection  networks  vertices  arc 
processors,  the  degree  is  the  number  of  links  inci¬ 
dent  at  a  processors  representing  a  vertex,  and  the 
diameter  is  the  number  of  links  necessary  for  broad¬ 
casting  from  everybody  to  everybody. 

An  upper  bound  on  n(A,D)  is  given  by  Moore: 

(1)  n(AJD)  £  (A  (A  -  1)D  -  2)  /  (A  -  2)  (A  >  3). 

The  proof  of  bound  0)  becomes  obvious  if  to  look 
at  (A  -  l)-tree. 

Apparently  for  D  >  2  Moore’s  bound  is  never 
achieved  (and  one  can  even  decrease  the  right  side 
of  (1)  by  2),  but  it  is  nearly  all  that  is  known  about 
the  upper  bound  on  n(A,D). 

Still,  one  expects  for  a  fixed  A  >  3,  that  the 
maximal  size  n  =  n(A,D)  of  a  graph  with  degree  A, 
diameter  D  is  expected  to  satisfy 

(2)  D  -  log  A  j  n. 

Exactly  this  kind  of  asymptotic  bound  (2)  was 
established  for  random  regular  graphs  of  degree  A 
in  [2];  see  review  in  [3].  Namely,  according  to  Bol- 
lobas-de  la  Vega  [2],  for  a  fixed  A  2  3,  for  any  e 
>  0,  the  diameter  D  of  "almost  any"  graph  of  de¬ 
gree  at  every  vertex  is  at  most 

(3)  [log  A  _  j  ((2  +  e)  n  log  n)]  +  1 

for  the  order  n.  This  means  that  for  any  e  >  0, 
the  proportion  of  all  A  -regular  graphs  of  order  n 
with  diameter  less  than  (3)  tends  to  zero  as  n  -» «>. 

It  is  relatively  easy  for  orders  n  in  the  mid¬ 
thousands  to  construct  A  -regular  random  graphs  of 
order  n  with  diameter  within  the  bounds  (2)  or 
(3).  Unfortunately,  no  regular  construction  of  such 
"random"  graphs  is  known  for  general  n,  and, 
moreover,  diameters  of  these  graphs,  though 
asymptotically  within  Moore’s  bound,  are  far  off 
the  expected  bounds. 

The  best  lower  bounds  on  n(A J>)  are  provided 
by  graphs  with  quite  a  regular  layout  that  are  usual¬ 
ly  compositions  of  graphs  of  small  size  and  have 
high  degree  of  symmetry.  For  some  of  the  recent 
tables  in  the  range  A  £  16  and  D  £  10  and 
various  methods  of  graph  composition  (various 
products,  etc.)  see  [6]  -  [9]. 

To  find  large  graphs  with  small  diameters  and 
to  simplify  the  routing  of  data,  we  look  at  graphs 
with  high  degree  of  symmetry.  A  natural  class  of 
such  graphs  are  Cayley  graphs  of  finite  croups  that 
automatically  possess  large  groups  of  automor¬ 
phisms. 

Definition  2.  Let  G  be  a  group,  and  C  =  [ci 

. cu)  be  a  set  of  its  generators.  A  Cayley  graph 

G  =  G  (G,C)  associated  with  G  and  C  has  as  its 


vertex  set  X  =  G,  and  two  vertices  gj,  g2  e  G  are 
adjacent  (connected  by  the  edge)  iff  gi  =  c  for 
c  e  C.  If  C  '  =  C,  the  Cayley  graph  G  is  un¬ 

directed. 

The  problem  of  minimal  diameter  of  Cayley 
graphs  can  be  reformulated  as  a  problem  of  finding 
the  generators  such  that  each  group  element  is  writ¬ 
ten  as  the  word  of  the  shortest  length. 

From  the  point  of  view  of  practical  realization 
of  interconnection  network  built  from 
TRANSPUTERS,  our  primary  interest  lies  with  de¬ 
gree  A  =  4,  when  there  are  4  adjacent  vertices  to 
every  one.  In  the  case  of  Cayley  graphs  this  cor¬ 
responds  to  finite  groups  G  with  two  generators  A, 
B  and  C  =  [A3,A^3  )  (i.e.  A  =  4). 

§2.  Cayley  graphs  of  factors  of 
modular  group  and  subgroups  of 
GL2(Fp). 

Cayley  graphs  of  classical  series  of  finite 
groups  turned  out  to  be  good  models  of  intercom¬ 
munication  networks.  Some  of  the  best  series  are 
given  by  Cayley  graphs  of  SL2(Fq). 

In  particular,  let  us  look  at  G  =  SL2(Z/pZ) 
with  C  =  (A3,A‘13'  with  A  and  B  that  are 
reductions  (mod  p)  of  two  generators  of  a  free  sub¬ 
group  of  a  full  modular  group  T(l)  =  SL2(Z)  (e.g. 
of  a  commutant  subgroups).  Varying  A  and  B 
we  get  Cayley  graphs  associated  with  this  G  = 
SL/>(Fn)  and  C  =  (AJ3,A'13‘l).  that  have  a  rela¬ 
tively  small  diameter  and,  simultaneously,  relative¬ 
ly  large  girth  [14]  (girth  is  the  length  of  the  shortest 
nontrivial  cycle). 

The  factors  of  modular  group  modulo  con¬ 
gruence  subgroups  seem  to  be  an  effective  way  to 
generate  graphs  that  locally  look  like  trees  (cf.  with 
a  free  groups).  The  routing  in  graphs  correspond¬ 
ing  to  the  action  of  this  group  can  be  expressed  in 
terms  of  the  Mobius  transformations,  and  after 
reductions  can  be  expressed  in  terms  of  the  general¬ 
ized  Euclidean  g.c.d.  algorithm.  For  series  of 
Cayley  graphs  associated  with  G  =  SL^CFp)  we 
have  established  the  following  asymptotic  upper 
bound  on  the  diameter  D 

(4)  D  -  log  lW2  Card(G) 

as  p  -»  oo  .  The  girth  of  these  Cayley  graphs  has 
the  lower  bound  of  the  same  order  of  magnitude. 

Though  this  bound  is  not  asymptotically 
equivalent  to  the  Moore’s  one,  we  have  the  first 
regular  construction  of  an  infinite  series  of  4- 
regular  graphs  with  diameters  asymptotically  better 
than  in  any  other  construction. 

Choosing  different  generators  of  G  we  can 
extend  our  construction  to  Cayley  graphs  of  ar¬ 
bitrary  degree  £  4.  Belter  diameters  than  those 
given  by  (4)  arise  in  Cayley  graphs  of  Borel  sub¬ 
groups  of  GL2(Fp)  and  Gl^Fq).  This  cor- 


responds  to  subgroups  of  upper  triangular  2x2 
matrices.  Classes  of  these  Cayley  graphs  and  other 
similar  graphs  (corresponding  c.g.  to  the  action  of  a 
group  on  a  finite  set)  have  distinct  advantages  in 
practical  implementations  as  models  of  interconnec¬ 
tion  networks.  First,  the  diameters  are  relatively 
small  for  large  sizes.  Most  important,  though,  is 
the  simplicity  of  construction  of  routing  tables. 
H.g.  one  requires  as  little  as  0(1)  memory  on  each 
processor-node  if  to  allow  for  processing  time 
0(log  2  n)  to  compute  locally  the  data  routing. 
(This  is  instead  of  0(n  2)  total  storage  necessary  for 
a  general  graph  of  size  n.)  On  the  other  hand,  the 
reliability  problem  (when  a  link  or  a  node  fails,  but 
the  network  continues  to  function)  favors  random 
graphs  with  less  rigid  routing  Cayley  graphs  of 
classical  groups  like  SL2(Fp)  look  Like  random 
graphs  and  have  similar  reliability.  In  connection 
with  fault-  tolerance  one  also  requires  high  connec¬ 
tivity  (A  -  connectivity)  that  we  found  in  our 
Cayley  graphs.  Another  important  requirement  for 
implementation  of  various  mathematical  problems 
is  the  existence  of  the  Hamiltonian  path,  imbedding 
of  various  grids  and  fast  realizations  of  particular 
permutations  common  in  many  applied  programs 
(row-to-row  exchange,  FFT  butterfly  operations,... 
etc).  Cayley  graphs  corresponding  to  Fq  are  well 
suited  for  realizations  of  these  requirements,  though 
the  array  sizes  best  suited  for  data  routing  are  ex¬ 
pressed  in  mod  p  arithmetic.  The  choice  of  q  =  2  m 
seems  to  be  best  suited  for  arrays  of  sizes  propor¬ 
tional  to  the  powers  of  2.  The  permutation  rout¬ 
ing,  when  the  permutation  is  known  in  advance,  for 


Borel  subgroups  of  GL2(F q)  is  quite  similar  to  that 
of  CCC  networks  [12]  ancf  requires  at  worst  0(log 
n)  steps  to  simulate  the  Benes  network  of  size  n. 
The  solution  to  the  problem  of  the  best  realization 
of  an  arbitrary  permutation  on  a  random  graph  or 
on  a  Cayley  graph  of  SI-2(Fp)  is  not  known  to  us, 
but  some  optimal  routing  *strategies  for  rayley 
graphs  of  Borel  subgroups  are  presented  in  Chapter 
5.  Because  of  this  and  because  Cayley  graphs  of 
Borel  subgroups  give  the  best  diameters  in 
thousands-node  range,  we  recommend  these  graphs 
for  simulation  and  development.  Among  favorites 
of  two  of  the  authors  (C  &  C)  is  15,657-node  graph 
with  A  =  4  and  D  =  10.  One  should  expect, 
though,  for  these  A  and  D  the  existence  of  graphs 
with  over  100,000-nodes.  They  should  make  an  in¬ 
teresting  massively  parallel  machine. 

We  conclude  this  chapter  with  a  table  of  large 
graphs  of  degree  A  =  4  with  a  given  diameter  5. 
In  this  table  our  best  examples  are  Cayley  graphs 
of  the  Borel  subgroups  G  of  GL2(Fp)  with  two 
generators  A3  (chosen  according  to  the  conjugacy 
classes  of  G  to  give  the  minimal  diameter).  The 
subgroups  G  depend  on  a  prime  p  and  a 
parameter  a  e  Fp  \  [0,1 }  and  is  defined  as 


*  y 

G  =  {  (  )  :  x  =  am  mod  p]  c  GL2(Fp). 

0  I 


Sizes  of  graphs  of  degree  A  =  4  of  a  given  diameter  D 


diameter 

Moore's  bound 

Known 
graphs  (1987) 

New  Cayley  *) 
graphs 

D  =  3 

51 

40 

36 

D  =  4 

159 

95 

90 

D  =  5 

483 

364 

320 

D  =  6 

1,455 

731 

730.. 

D  =  7 

4,371 

856 

1 .08 1  ^ 

D  =  8 

13,119 

1,872 

2,943* 

D  =  9 

39,363 

4,352 

7.439*\ 

D=  10 

118,095 

13,056 

15,657* 

D  =  11 

354,291 

38.764* 

D  =  12 

1,062,879 

82,901* 
140, 607^ 

D  =  13 

3,118,643 

*)  Subgroups  of  GL^fZ/pZ). 

2  Borel  subgroup  u  with  p  =  47,  a  =  2. 

J  Borel  subgroup  G  with  p  =  109,  a  =  7. 

:  Borel  subgroup  G  with  p  =  1 73,  a  =  24 

^  Borel  subgroup  G  with  p  =  307,  a  =  l2 

Borel  subgroup  G  with  p  =  881,  a  =  3^° 

Borel  subgroup  G  with  p  =  901,  a  =  2. 

’’  Borel  subgroup  G  with  p  =  919,  a  =  2. 


Table  1. 


Remark.  One  realizes  that  ihcsc  G  arc  re¬ 
lated  to  de  Bruijn-like  networks  (e.g.  put  a  =  2). 
They  can  be  also  considered  as  a  generalization  of 
CCC  [12-13]. 

Cayley  graphs  with  G  =  SL^fFp)  give  less 
dense  sequence  of  graph  sizes.  The  best  diameters 
(for  special  choices  of  A,  B)  are:  G  =  SL-,(F17), 
Card(G)  =  2.184,  D  =  8;  G  =  SLo(FJ7).  Carc3(G)  = 
4.896.  D  =  9.  G  =  Sl^F,,).  Card(G)  =  12,144,  D 
=  10. 


§3.  Expansion  coefficients  and 
diameter. 

In  the  study  of  fast  data  routing  one  en¬ 
counters,  in  addition  to  (AJD)-problem,  similar  and 
loosely  related  subject  of  expanders,  diffusors  and 
superconcentrators.  To  be  precise,  let  us  look  at 
the  graph  G  =  (X.E),  and  for  any  subset  A  c  X  we 
define  the  boundary  of  A  as  3A  =  (x  e  X  :  d(x,A) 
=  1}.  Expanders  have  the  property  that  every  A  c 
X  has  a  large  boundary:  one  calls  a  graph  G  = 
(X.E)  an  (nAc)-expander,  if  G  is  a  A  -  regular 
graph  (G  has  degree  A  at  every  vertex)  of  n  ver¬ 
tices,  and  for  every  A  c  X,  Card(A)  <  n/2,  the 
boundary  3A  has  at  least  c  -  Card(A)  elements. 
The  constant  c  >  0  is  called  the  expansion  coeffi¬ 
cient. 

The  connection  between  the  diameter  problem 
and  expanders/superconcentrators  is  based  on  the 
spectral  properties  of  graph.  Let  us  denote  for  the 
graph  G  =  (XJE),  by  the  adjacency  mamtrix 
of  G  :  AG  =  (axy)  x  y  E  x.  where  axy  =  ’  if  xy  e 
E  and  axy  =  0  otherwise.  If  G  is  a  regular  graph 
of  degree  A,  A  is  the  largest  (in  absolute  value) 
eigenvalue  of  A q.  We  denote,  as  usual,  by  Xi  = 
\l(G)  the  second  largest  (in  the  absolute  value) 
eigenvalue  of  A(j.  This  eigenvalue  is  usually 
called  the  diffusion  coefficient  The  relationship  be¬ 
tween  expansion  and  diffusion  coefficients  is  the 
folowing  (see  [18]).  A  A  -regular  graph  G  of 
size  n  is  an  (n  Ac>expander  with 

c  =  (1-M/  A)/2, 

also  one  always  has  Xi  S  A  -  c2 /<-»-►  2  c2), 
see  [16],  [18].  It  seems  intuitively  dear  that  graphs 
with  large  expansion  coefficients  should  have  a 
small  diameter.  This  had  been  substantiated  to  an 
extent  with  the  following  geometric  result  from 
[18]: 

For  a  regular  graph  of  degree  A  with  a  given 
X-i,  the  diameter  is  bounded  in  terms  of  the  sue  of 
the  graph  as  follows: 

(5)  D  S  2  [V2A/(A-Xj)  log  2  n). 


The  construction  of  graphs  with  the  smallest 
diffusion  coefficient  is  important  in  the  construction 
of  supcrconcentrators.  However,  the  inequality  (5)' 
on  D  in  terms  of  Xi  does  not  provide  a  sharp 
bound  at  all;  one  expects  log  n  and  not  log2  n. 
We  conjecture,  however,  that  the  relationship  is 
sharper 

Conjecture.  For  a  A  -regular  graph  G  of  size 
n,  the  relationship  between  D  and  Xj  is  the  follow¬ 
ing 

D  £  (log  Xi/2  n  )  /  2  +  o  (log  A  .  j  n). 


This  relationship,  if  true,  should  be  asymptoti¬ 
cally  (!)  the  best  possible.  Indeed,  there  is  always  a 
lower  bound  on  Xi:  Xj  2  2  V  A  -  1  as  n  -*  «  . 
Graphs  for  which  X|  £  2  >/  A  -  1  were  called  in 
[16]  Ramanujan  graphs.  Examples  of  authors  [16] 
(1985-86)  of  Ramanujan  graphs  included  graphs 
arising  from  quotients  of  quaternion  Fuchsian 
groups  modulo  congruence  subgroups  of  level  p 
(connected  with  quadratic  forms  in  four  variables). 
Unfortunately,  for  moderate  primes  (p  <  10,000) 
none  of  these  Ramanujan  graphs  have  a  small 
diameter.  It  seems  that  diameters  of  Ramamujan 
graphs  are  worse  than  diameters  of  random  regular 
graphs  of  the  same  size.  Nevertheless,  some  of  the 
graphs  of  relatively  small  diameter  can  be 
Ramanujan  graphs.  Among  these  graphs  are 
Cayley  graphs  of  SL2(FB)  for  a  large  set  of  p’s 
(numerical  experiments,  cr.  [15],  show,  though,  that 
not  for  all  p’s  and  not  for  all  generators  A  and  B 
these  graphs  are  Ramanujan  ones). 

In  general,  having  a  small  diameter  has  noth¬ 
ing  to  do  with  having  small  Xj,  as  signified  by  a 
large  number  of  de  Brujin-like  graphs  having  rela¬ 
tively  small  diameter  and  large  X  j.  The  main  in¬ 
terest  in  Xj,  in  addition  to  its  importance  in  super- 
concentrators,  is  purely  geometric.  For  various 
Cayley  graphs  G  that  arise  from  algebraic  curves 
over  Fn  for  q  =  pn  (as  n  -»  <»  )  Xi  and  other  non¬ 
trivial  Eigenvalues  of  A^  are  closely  connected  to 
eigenvalues  of  Frobenius  x  — »  xp  acting  on  this 
curve.  In  the  case  of  algebraic  curves  over  finite 
(and  local)  fields  that  are  uniform ized  by  special 
groups  of  Mobius  transformations  (Mumford’s 
groups  and  uniform izations),  Ihara’s  theory  [19]  al¬ 
lows  to  express  the  distance  function  on  Cayley 
graphs  corresponding  to  factors  of  arithmetic 
groups  in  terms  of  eigenvalues  of  Hecke 
(Frobenius)  operators.  This  often  gives  "explicit- 
formulas  for  Xi,  and  make  their  study  very  interest¬ 
ing.  The  growth  of  Xj  (and  D)  in  this  case  is  deter¬ 
mined  by  the  sizes  of  eigenvalues  of  Frobenius,  Le. 
ultimately  by  Weil-Deligne  bounds  in  Weil- 
Ramanujan  problems  for  ^  -function  of  appropriate 
curves  and  algebraic  varieties. 


§4.  Pseudorandom  and  other 
Interesting  classes  of  graphs. 

One  can  try  10  combine  the  benefits  of 
regularity  of  layout  of  Cayley  graphs  with  fault- 
tolerance  and  good  extremality  properties  of  ran¬ 
dom  graphs  by  looking  at  "pseudorandom'1  graphs. 
To  this  category  of  graph  belong  graphs  generated 
by  several  random  permutations.  Alternatively,  one 
can  consider  as  a  pseudorandom  a  Cayley  graph  for 
an  appropriate  finite  group  G  with  a  random 
choice  of  generators  A  and  B.  In  fact,  some 
Cayley  graphs  for  SL^fFp)  can  be  considered  as 
such. 

If  a  graph  has  a  Hamiltonian  cycle  (and  we 
are  interested  only  in  this  class  of  graphs),  one  can 
consider  its  vertices  as  lying  on  a  circle  of  n  ver¬ 
tices  with  i-th  vertex  connected  to  i  +  1  (mod  n). 
An  easy  way  to  generate  pseudorandom  graphs  is 
to  choose  a  random  permutation  n  of  (l,...n)  and 
connect  n(i)  with  n(i  ±  1)  (mod  n).  These  graphs 
can  be  described  invariantly  as  those,  having  two 
Hamiltonian  paths  with  non-overlapping  links. 
Choosing  permutation  it  is  various  ways  one  can 
decrease  diameter  and  increase  girth  of  such  pseu¬ 
dorandom  graphs.  Also,  expansion  coefficients 
match  those  of  Ramanujan  graphs  for  these  pseudo¬ 
random  ones.  This  construction  is  very  promising 
for  practical  implementation.  We  performed  com¬ 
puter  experiments  for  data  routing  on  these  pseudo¬ 
random  graphs  that  showed  a  satisfactory  perfor¬ 
mance.  (For  example,  the  routing  according  to  a 
random  permutation  of  n  nodes  for  n  in  the  range 
of  200  -  1,000  required  in  average  the  routing  limes 
of  D  +  1.  This  was  achived  with  an  obvious  rout¬ 
ing  strategy  "send  data  along  the  shortest  path  from 
the  addresee  to  the  addressant".  As  a  matter  of 
fact,  this  obvious  strategy  performs  much  better  on 
a  random  or  a  pseudorandom  graphs  than  on  a 
regular  layout  graph  with  a  minimal  diameter.  One 
can  see  why  this  happens  by  looking  at  a  local  A  - 
1  -  tree,  and  sending  messages  from  one  branch  to 
another  across  the  root  of  the  tree.  Combining  ob¬ 
vious  routing  strategy  for  random  permutations 
with  multi-phase  oblivious  strategy  of  Chapter  5 
one  can  arrive  at  an  efficient  randomized  routing 
strategy  for  an  arbitrary  permutation.) 

Some  of  the  graphs  with  Hamiltonian  paths, 
like  above,  are  called  chordal  graphs.  This  name  is 
usually  reserved  not  for  pseudorandom  gT3phs.  but 
for  those  where  the  chordal  connections  between  n 
nodes  on  the  circle  is  determined  in  a  regular 
fashion  as  a  simple  number-theoretic  function.  A 
precise  definition  can  be  the  following,  see  i  2 0 ; . 

Definition  of  Generalized  Chordal  Kings. 
The  graph  G  =  (XJE)  is  a  generalized  chordal  nng 
if  X  =  (l,...,n)  mod  n,  and  there  are  divisors  q  of 
n  (q  *  n)  such  that  vertex  i  is  joined  to  vertex  j 


iff  the  vertex  i  +  q  (mod  n)  is  joined  to  j  +  q  (mod 

n). 

If  all  edges  (i,  i  +  1)  appear  in  the  graph  (i.c. 
q  =  1),  then  this  graph  is  called  the  chordal  ring. 

For  small  n  some  of  the  best  (A, Digraphs 
arc,  in  fact,  chordal  rings  [20],  [9].  It  is  an  interest¬ 
ing  number-  theoretic  problem  how  to  determine  ex¬ 
plicitly  the  diameter  of  a  generalized  chordal  ring 
given  the  divisors  q  of  n  and  the  lengths  of 
chords.  Here  by  the  length  of  a  chord  in  a  general¬ 
ized  chordal  ring  one  understands  such  an  integer 
n  that  nodes  i  and  i  +  n  are  connected  for  a  given 
i  (mod  n/q). 

A  word  of  caution:  for  a  large  n  generalized 
chordal  rings  are  not  the  best  (AJD)-graphs  for  a 
fixed  A. 

§5.  Data  routing  on  regular  graphs. 

In  practical  realizations  of  interconnection  net¬ 
works  as  models  of  parallel  computers,  one  of  the 
most  important  problems  is  that  of  fast  realization 
of  data  routing,  particularly  that  of  permutation  rout¬ 
ing.  The  routing  strategies  are  usually  divided  into 
local  and  global,  or  into  oblivious  and  nonoblivious 
[20  -  23].  A  startegy  is  called  an  oblivious  one, 
when  the  route  of  any  packet  depends  only  on  the 
origin  and  destination  of  the  packet  Oblivious 
strategies  are  not  quite  local  baause  the  time  to 
send  a  packet  along  an  edge  can  be  determined  by 
a  non-  local  decision.  Whenever  the  permutation  is 
known  in  advance,  the  global  strategy  can  be  ad¬ 
vantageous.  One  can  try  to  simulate  in  this  case  a 
Benes-type  network  to  achieve  as  the  global  worst 
case  time  OOog  n)  for  a  routing  on  a  network  of  n 
nodes.  This  routing  time  can  be  achieved  after  a 
considerable  effort  on  precompilation,  but  can  be 
proved  for  many  networks  of  classical  computer 
science.  These  include  d-dimensional  cubes,  shuf¬ 
fle  eschange,  CCC  -  network  and  its  variations,  [21 
-  24].  All  regular  graphs  described  in  Chapter  2 
also  belong  to  this  category,  and,  in  particular,  on 
regular  graphs  associated  with  the  Borel  subgroups 
of  GL^lrp)  of  size  n  one  can  emulate  a  Benes  net¬ 
work  ut  Curne  0(log  n).  Another  classical  routing 
strategy  is  based  on  Batcher’s  OOog  2  n)  -  sorting 
network  algorithm  [24].  This  strategy  can  be  imple¬ 
mented  on  classical  networks  and  on  regular  graphs 
of  Chapter  2.  In  practical  implementation  of 
programming  on  massively  parallel  machines  the 
most  attention  is  attracted  to  oblivious  strategies 
that  are  easy  to  implement  on  any  switching  net¬ 
work.  There  exist  two  negative  results  concerning 
these  strategies.  The  first  of  them  by  Borodin  and 
Hopcroft  [24]  states  that  in  any  network  of  n  nodes 
of  degree  A  the  time  required  in  the  worst  case  by 
any  oblivious  routing  strategy  is  fl(Vn/A  irz).  We 
can  supplement  this  negative  result  by  the  following 

Lemma.  In  any  graph  of  n  nodes  of  degree  A, 
any  routing  strategy,  where  the  next  step  in  the 
route  of  the  packet  depends  only  on  its  present  loca- 


tion  and  its  final  destination,  requires  in  the  worst 
case  the  time  of  ft(n/(A  (A  +  1))). 

An  easy  proof  of  this  lemma  also  shows  that 
for  any  such  local  strategy  there  is  a  permutation 
and  a  "hot"  node  such  that  a  given  routing  strategy 
requires  at  least  n/(A  +  1)  messages  to  pass  through 
this  node. 

The  relevance  of  these  negative  results  for 
practical  realization  of  mathematical  programming 
on  massively  parallel  computers  with  simple 
oblivious  strategies  of  routing  is  unclear.  Generic 
permutations  can  be  realized  on  classical  networks 
without  significant  congestion.  However,  there  are 
explicit  permutations,  not  artificially  constructed, 
that  can  clog  the  shuffle  exchange  and  other  similar 
switching  networks.  These  permutations  are  con¬ 
nected  with  bit-reversal  mappings  appearing  in  the 
Coolcy-Tuckey  FFT  algorithms  (with  the  radices  in 
the  FFT  connected  with  parameters  of  the  shuffle 
networks).  On  the  other  hand,  the  fact  that  almost 
all  permutations  do  not  create  large  contentions  on 
classical  networks  gave  rise  in  [21-23]  to  two- 
phase  randomized  oblivious  strategy  of  routing.  In 
this  strategy  on  the  first  phase  each  packet  is  sent 
to  a  random  destination  and  in  the  second  phase  it 
is  forwarded  towards  the  desired  destination,  thus 
realizing  the  routing  of  two  random  permutations. 
This  strategy  was  shown  in  [23]  to  be  nearly  op¬ 
timal  for  shuffle  networks,  because,  according  to 
[23],  any  (deterministic  or  randomized)  oblivious 
routing  strategy  either  takes  the  time  fl(n  e)  for 
some  e  >  0,  or  makes  packets  to  travel  at  least 
twice  the  diameter. 

The  variations  in  this  multi-phase  strategy  can 
be  adopted  for  any  of  the  regular  graphs  described 
in  Chapters  2  and  4.  These  strategies  have  a  dis¬ 
tinct  advantage  of  testability  in  the  sense  that  the 
distribution  of  delays  is  independent  of  the  permuta¬ 
tion  to  be  routed  (since,  in  a  sense,  we  are  testing 
only  routing  of  a  random  permutation).  Thus  the 
distribution  can  be  efficiently  evaluated  by  a  com¬ 
puter  simulation.  This  was  done  for  the  best  (4,D)  - 
graphs  of  sizes  up  to  3,000.  Similar  simulation  was 
conducted  for  random  graphs  (of  sizes  up  to  1,000 
and  pseudorandom  graphs  of  sizes  up  to  2,000). 

More  complicated  randomized  oblivious  rout¬ 
ing  strategies  are  necessary  if  sizes  of  buffers  per 
node  are  limited.  A  variation  of  two-phase  strategy 
easily  provides  with  Oflog  n)  buffers  in  routing  for 
regular  graphs  associated  with  Borel  subgroups  of 
GLofFp)  of  size  n.  These  startegies  are,  in  par¬ 
ticular,  testable  by  computer  experiments.  Improve¬ 
ments  in  these  randomized  strategies  like  in  [21]  or 
[25],  provide,  for  the  same  class  of  graphs,  with 
constant  buffering  of  0(1)  and  with  the  worst  rout¬ 
ing  time  of  0(log  n).  The  problem  of  optimal  con¬ 
stant  in  the  worst  routing  time  is  still  an  open  one. 
Another  problem  is  the  testability  of  the  best  rout¬ 
ing  startegies  for  random  graphs.  Limited  buffer¬ 
ing  as  n  -+  «•  creates  also  a  possibility  of  a  dead¬ 
lock  (infinite  propagation  time);  this  never  occurs  if 
buffers  are  as  large  as  a  diameter. 


References 


[1]  B.  Elspes,  Topological  constraints  on 
interconnection  limited  logic,  in  "Proceedings  5th 
symposium  on  switching  circuit  theory  and  logical 
design,  v.  5-164,  pp.  133-  197,  IEEE,  New  York, 
1964. 

[2]  B.  Bollobas,  W.F.  de  la  Vega,  The  diameter 
of  random  regular  graphs,  Combinatorica  2  (1982), 
125-123. 

[3]  B .  Bollobas,  Random  Graphs.  Academic 
Press,  New  York,  1985. 

[4]  N.C.  Wormald,  Generating  random  regular 
graphs.  J.  of  Algorithms  5  (1984),  247-280. 

[5]  R.I.  Storwick,  Improved  construction 
techniques  for  (d,k)-  graphs.  IEEE  Trans.  Comput 
C- 19,  (1970),  1214-1216. 

[6]  W.E.  Leland,  R.  Finkel,  L.  Qiao,  NIL 
Solomon,  L.  Urh,  High  density  graphs  for 
processor  interconnection.  Inform.  Process.  Lett.  12 
(1981),  117-120. 

[7]  G.  Memmi,  Y.  Raillard,  Some  new  results 
about  the  (djk)-  graph  problem,  IEEE  Trans. 
Comput.,  C-31  (1972),  784-791 

[8]  J.-C.  Bermond,  C.  Delorme,  J.-J.  Quisquater, 
Tables  of  large  graphs  with  given  degree  and 
diameter.  Inform  Process.  Lett.  15  (1982),  10-13. 

[9]  J.-C.  Bermond,  C.  Delorme,  J.-J.  Quisquater, 
Strategies  for  interconnection  networks:  some 
methods  from  graph  theory,  J.  Parallel  and 
Distributed  Computing  3  (1986),  433-449. 

[10]  A.  Ralston,  de  Bruijn  sequences.  A  model 
example  of  the  interaction  of  discrete  mathematics 
and  the  computer  science.  Math.  Mag.  55  (1982), 
131-143. 

[11]  M.  Imase,  M.  Stoch,  A  design  of  directed 
graphs  with  minimum  diameter.  IEEE  Trans. 
Comput.  C-32  (1983),  782-784. 

[12]  F.  Preparata,  J.  Vuillemin,  The 
cube-connected  cycles:  a  versatile  network  for 
parallel  computation.  Commun.  Ass.  Comput 
Mach.  24  (1981), 

[13]  G£.  Carlson,  J£.  Cruthirds,  H.B.  Sexton, 
C.G.  Wright  Interconnection  networks  based  on  a 
generalization  of  cube-  connected  cycles.  IEEE 
Trans.  Comput  C-34  (1985),  769-  777. 

[14]  G.A.  Margulis,  Explicit  construction  of 
graphs  without  short  cycles  and  low  density  codes. 
Combinatorica  2  (1981),  71-  78 

[15]  N.W.  Buck,  Expanders  and  diffusers.  SIAM 
J.  Alg.  Dise.  Math.  7  (1986),  282-304. 

[16]  A.  Lubotzky,  R.  Phillips,  P.  Sarhak, 
Ramanujan  graph.  Preprint  1986. 

[17]  A.-N.  Esfahanian,  SJ-.  Halrimi, 

Fault-tolerent  routing  in  de  Bruijn  communication 
networks.  IEEE  Trans.  Comput  C-34  (1985), 
777-788. 

[18]  N.  Alon,  V.D.  Milman,  Xj,  Isoperimetric 
inequalities  for  graphs,  and  superconcentrators.  J. 
Comb.  Theory.  38B  (1985).  73-  88. 


[19]  Y.  Ihara,  Discrete  subgroups  of  PL(2,kn). 
Proc.  Symp.  Pure  Math.,  v.  9, 272-278,  A.M.ST, 
1966. 

[20]  K.W.  Doty,  New  design  for  dense  processor 
interconnection  network.  IEEE  Trans.  Comput 
C-33  (1984),  447- 450. 

[21]  E.  Upfal,  Efficient  schemes  for  parallel 
communication,  J.  Assoc.  Comp.  Mach.,  31  (1984), 
507-517. 

[22]  L.G.  Valiant,  G.S.  Brebner,  Universal 
schemes  for  parallel  communication,  Proc.  Annual 
ACM  Symp.  Theory  Computing,  1981, 263-277. 

[23]  L.G.  Valiant  Optimality  of  a  two-phase 
strategy  for  routing  in  interconnection  networks, 
IEEE  Trans.  Comput  C-32  (1983),  861-863. 

[24]  A.  Borodin,  J.E.  Hopcroft,  Routing, 
merging,  and  sorting  on  parallel  models  of 
computation,  J.  Computer  System  Sci.,  30  (1985), 
130-145. 

[25]  N.  Pippinger,  Parallel  communication  with 
limited  buffers,  Pi roc.  Annual  ACM  Symp.  Theory 
Computing,  1984, 127-136. 


CHUDNOVSKY,  Gregory  V. 

Department  of  Mathematics 
Columbia  University 
New  York,  New  York  10027 

Born:  April  17,  1952  in  Kiev,  USSR.  Naturalized  in  1983. 

Education: 

Kiev  State  University,  1969-1974,  Diploma  in  Mathematics  (mention  "summa  cum 
laude"). 

Institute  of  Mathematics,  Ukrainian  Academy  of  Sciences,  Kiev,  June  1975,  PhD. 

Positions  held: 

Research  Fellow,  Kiev  State  University,  1974-1976. 

Visiting  Professor,  Institute  des  Hautes  Etudes  Scientifiques,  Bur es-sur -Yvette,  France, 
September,  1977 -February,  1978;  June,  1978-August,  1978. 

Maitre  de  Conference,  University  of  Paris  VI,  October,  1977-  February,  1978. 

Currently  (since  February,  1978):  Senior  Research  Scientist  in  the  Department  of 
Mathematics,  Columbia  University,  New  York. 

Maitre  de  Recherche,  Centre  National  de  la  Recherche  Scientifique,  Paris,  March, 
1979-September,  1979;  May,  1980-  September,  1980;  July,  1981 -October,  1981. 

Fields  of  Mathematical  Interest: 

Number  Theory:  Analytic  number  theory,  Diophantine  approximations  and  transcen¬ 
dence  theory.  Mathematical  Physics:  Non-linear  equations,  quantum  and  classical 
field  theories.  Computer  Science:  Computer  algebra  and  complexity. 

Awards  and  Honors: 

Prize  of  the  Moscow  Mathematical  Society,  1970. 

Prix  Peccot-Vimont,  1979,  France, 

John  Simon  Guggenheim  Fellowship,  1980. 

Mac  Arthur  Prize  Fellowship,  1981-1986, 

Doctor  of  Science  honoris  causa,  Bard  College,  New  York,  1981. 

Addresses: 

Invited  lecture,  Colloquium  of  Paris  Universities,  November,  1977. 

Invited  address,  International  Congress  of  Mathematicians,  Helsinki,  Finland,  August, 
1978. 

Cours  Peccot  at  College  de  France,  1979. 


Invited  lectures,  Oberwolfach  Conferences  on  Diophantine  Approximations,  1977, 
1979,  1981,  1983. 

Cours  of  lectures,  Number  Theory  Year  at  University  of  Maryland,  College  Park, 
April,  1978. 

Invited  address  at  the  Dutch  Mathematical  Congress,  Eindhoven,  Holland,  April,  1979. 

Invited  one-hour  address  at  the  87th  Annual  Meeting  of  the  American  Mathematical 
Society,  San  Francisco,  January,  1981. 

Invited  address  at  N.Y.U.  Conference  "Computer  Algebra  as  a  Tool  for  Research  in 
Mathematics  and  Physics",  April,  1984. 

Invited  address  at  the  Conference  "Computers  and  Mathematics" ,  Stanford  University, 
August,  1986. 

Invited  talk  at  the  Conference  "Elliptic  Curves  and  Modular  Forms  in  Algebraic  Topol¬ 
ogy",  Princeton  IAS,  September,  1986. 

Invited  talk  at  the  Ramanujan  Centenary  Conference,  University  of  Illinois,  Urbana, 
June,  1987. 

Invited  lectures  at  A  M.S.  Summer  School  on  Theta  Functions,  Bowdoin  College, 
Maine,  July,  1987. 

Research  Awards: 

1978-  :  National  Science  Foundation,  Mathematics  Section. 

1978-1982  :  Office  of  Naval  Research,  Mathematics  Division. 

1980-  :  Air  Force  Office  of  Scientific  Research. 

1986-  :  OCREAE  Program,  N.S A. 

1987-  :  DARPA. 

Consulting: 

IBM  T.  J.  Watson  Research  Center,  1982- 
Hudson  Institute,  1984- 

Chairman  (with  D.  V.  Chudnovsky,  H.  Cohn,  M.  B.  Nathanson)  of  New  York  Number 
Theory  Seminar:  1981- 

Member  of  the  Organizing  Committees: 

Computer  Algebra  as  a  Tool  for  Research  in  Mathematics  and  Physics;  N.Y.U. ,  April, 
1984; 

Computers  and  Mathematics,  Stanford.  August,  1986; 

AMS.  Summer  School  on  Theta  Functions.  Bowdoin  College,  July,  1987. 


CHUDNOVSKY,  David  V. 

Department  of  Mathematics 
Columbia  University 
New  York,  New  York  10027 

Born:  January  22,  1947  in  Kiev,  USSR.  Naturalized  in  1983. 

Education: 

Kiev  State  University,  1964-1969,  Diploma  in  Mathematics  ( mention  "summa  cum 
laude"). 

Post  graduate  fellow,  Institute  of  Mathematics,  Ukrainian  Academy  of  Sciences,  Kiev, 
1969-1972.  June  1972,  PhD. 

Positions  held: 

Research  Fellow  and  Senior  Research  Fellow,  Institute  of  Mechanics,  Ukrainian 
Academy  of  Sciences.  1969-1976. 

Research  Fellow,  Centre  de  Mathematiques,  Ecole  Polytechnique,  October,  1977- 
February,  1978;  June,  1978-September,  1978. 

Currently  ( since  February,  1978):  Senior  Research  Scientist  in  the  Department  of 
Mathematics,  Columbia  University,  New  York. 

Visiting  Professor,  Service  de  Physique  Theorique,  Center  of  Nuclear  Energy-Saclay, 
Gif-sur  Yvette,  France,  March,  1979-  September,  1979;  May,  1980-September,  1980. 

Charge  de  Recherche,  Centre  National  de  la  Recherche  Scientifique,  Paris,  July,  1981- 
November,  1981. 

Fields  of  Mathematical  Interest: 

Theoretical  Mathematics:  Number  Theory,  General  Topology,  partial  differential  equa¬ 
tions,  Hamiltonian  systems.  Mathematical  Physics:  Field  theories,  quantum  systems. 
Computer  Science:  Computer  algebra  and  complexity. 

Awards  and  Honors: 

John  Simon  Guggenheim  Fellowship.  1980. 

Addresses: 

Course  of  invited  lectures,  Cargese  Summer  School  of  Mathematical  Physics;  Cargese, 
France,  June-July,  1979. 

Invited  lecture,  International  Colloqu.m  <n  Complex  Analysis  and  Relativistic  Quan¬ 
tum  Theory,  Centre  de  Physique  les  Huiur.cs,  France,  September,  1979. 

Address  at  the  International  Meeting  on  "Nonlinear  Evolution  Equation  and  Dynami¬ 
cal  Systems",  Lecce,  Italy,  June,  1979. 


Colloquium  lecture  at  the  Physical  Institute  of  Utrecht  University,  Holland,  April,  1979. 

Course  of  lectures  at  International  Conference  on  Theoretical  Physics,  Ecole  Normale 
Superieure,  Paris,  August,  1980. 

Address  to  the  XI  International  Conference  in  Group  Theoretical  Methods  in  Physics, 
Istanbul,  August.  1982. 

Invited  address  at  N.Y.U.  Conference  "Computer  Algebra  as  a  Tool  for  Research  in 
Mathematics  and  Physics",  April,  1984. 

Invited  address  at  the  Conference  "Computers  and  Mathematics" ,  Stanford  University, 
August,  1986. 

Invited  talk  at  the  Conference  "Elliptic  Curves  and  Modular  Forms  in  Algebraic  Topol¬ 
ogy",  Princeton  IAS,  September,  1986. 

Invited  talk  at  the  Ramanujan  Centenary  Conference,  University  of  Illinois,  Urbana, 
June,  1987. 

Invited  lectures  at  A.M.S.  Summer  School  on  Theta  Functions,  Bowdoin  College, 
Maine,  July,  1987. 

Research  Awards: 

1978-  :  National  Science  Foundation,  Mathematics  Section. 

1978-1982:  Office  of  Naval  Research,  Mathematics  Division. 

1980-  :  Air  Force  Office  of  Scientific  Research. 

1986-  :  OCREAE  Program,  N.S A. 

1987-  :  DARPA. 

Consulting: 

IBM  T.  J.  Watson  Research  Center,  1982-  . 

Hudson  Institute,  1984- 

Chairman  (with  D.  V.  Chudnovsky,  H.  Cohn,  M.  B.  Nathanson)  of  New  York  Number 
Theory  Seminar:  1981- 

Member  of  the  Committee  on  Translations,  American  Mathematical  Society  &  ASL, 
1984-1987. 

Co-chairman: 

Computer  Algebra  as  a  Tool  for  Research  in  Mathematics  and  Physics;  N.Y.U. ,  April, 
1984; 

Computers  and  Mathematics,  Stanford,  August,  1986; 

Member  of  the  Organizing  Committee: 

A.M.S.  Summer  School  on  Theta  Functions,  Bowdoin  College,  July,  1987. 


